Search
Now showing items 141-150 of 227
NCHLT Siswati Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
Autshumato Afrikaans-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from Afrikaans to English (EN-GB), in the government domain for use in the Autshumato ITE application.
Multilingual Arts & Culture Intermediate Phase Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
550 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
Test treebank for the Tswana GF miniature resource grammar
(University of South Africa, 2018-03-01) ~ - Resource Index
A selection of 744 Tswana sentences with their GF parse trees
Afrikaans Radio News Speech Corpus
(Meraka Institute, CSIR; North-West University; Stellenbosch University; Stellenbosch Universtity, 2015-01-28) ~ - Resource Index
News bulletins purchased from the SABC. Data to be used for the development of a large vocabulary continuous speech recognition system for Afrikaans.
NCHLT isiNdebele Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
Lwazi isiNdebele ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~ - Resource Catalogue
Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
African Speech Technology isiXhosa Text Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ - Resource Catalogue
Monolingual text corpus developed during the African Speech Technology project.
Lwazi Sesotho Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~ - Resource Catalogue
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
Lwazi II English TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions.