Search

Now showing items 21-30 of 44

NCHLT isiXhosa Annotated Text Corpora

Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.

isiXhosa Genre Classification Corpus

Gerhard van Huyssteen; D.P. Snyman (Trifonius, 2013-06-19) ~ Resource Catalogue

Contains training and testing data for Genre Classification for isiXhosa.

African Wordnet: isiXhosa 1.0

African Wordnet Project (UNISA, 2017-06-20) ~ Resource Catalogue

Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...

DictionaryMaker

Marelie Davel; Maritali Tempest; Thomas Fogwill; Andries Janse van Rensburg; Pieter van Zyl; Francois Aucamp; Louis Joubert; Bryan McAlister (Meraka Institute, CSIR, 2013-07-15) ~ Resource Catalogue

The purpose of the DictionaryMaker system is to facilitate the creation of an electronic pronunciation dictionary in a target language, as originally ...

African Speech Technology isiXhosa Text Corpus

CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ Resource Catalogue

Monolingual text corpus developed during the African Speech Technology project.

NCHLT isiXhosa Speech Corpus

Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel; Alta de Waal (Meraka Institute, CSIR; North-West University, 2014-07-08) ~ Resource Catalogue

Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.

Lwazi II isiXhosa TTS Corpus

Daniel van Niekerk; Georg Schlünz (Meraka Institute, CSIR; North-West University, 2015-11-20) ~ Resource Catalogue

Orthographic and phonemically aligned transcriptions.

Speect

Daniel van Niekerk; Aby Louw (Meraka Institute, CSIR, 2013-07-15) ~ Resource Catalogue

Speect is a multilingual text-to-speech (TTS) system. It offers a full TTS system (text analysis which decodes the text, and speech synthesis, which ...

Lwazi III isiXhosa TTS Corpus

Aby Louw; Georg Schlünz (Meraka Institute, CSIR, 2016-06-17) ~ Resource Catalogue

Complete audio recordings with orthographic transcriptions. TTS corpus for standard SA dialect. This corpus was created to enable the building of a TTS voice.

Autshumato TMX Integrator

Martin Schlemmer; Wildrich Fourie (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue

Utility to merge multiple translation memories over a network using Subversion

View previous page
1
2
3
4
5
View next page