Search
Now showing items 181-190 of 227
Autshumato English-Xitsonga Manually Translated Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ - Resource Catalogue
Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
Lwazi III isiXhosa TTS Corpus
(Meraka Institute, CSIR, 2016-06-17) ~ - Resource Catalogue
Complete audio recordings with orthographic transcriptions. TTS corpus for standard SA dialect. This corpus was created to enable the building of a TTS voice.
African Wordnet: Tshivenda 1.0
(UNISA, 2017-06-20) ~ - Resource Catalogue
Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
NHN Zulu corpora
(University of the Witwatersrand, 2015-01-07) ~ - Resource Index
A first step to building a corpus of POS-annotated Zulu texts.
African Speech Technology Afrikaans-English Speech Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2014-12-10) ~ - Resource Catalogue
African Speech Technology speech and transcription data for the Afrikaans-English database. The "speech" directory contains English speech as spoken ...
Multilingual Election Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ - Resource Index
559 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
Siswati Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
Tshivenda Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for Tshivenda.
Pretoria Tshivenda Corpus
(Department of African Languages - University of Pretoria, 2015-01-27) ~ - Resource Index
Collection of texts for general linguistic research, in particular for lexicography
Lwazi Sepedi ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~ - Resource Catalogue
Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.