Search
Now showing items 61-70 of 238
Lwazi Sesotho ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~ - Resource Catalogue
Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
NCHLT Sesotho Annotated Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
NCHLT English Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-09-09) ~ - Resource Catalogue
Collection consisting of a clean corpus, lexicon, frequency list and named-entity lists developed during the NCHLT Text project.
High quality TTS data for four South African languages (af, st, tn, xh)
(Google; North-West University, 2017) ~ - Resource Catalogue
This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ...
African Wordnet: isiZulu 1.0
(UNISA, 2017-06-20) ~ - Resource Catalogue
Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
Lwazi II Sesotho TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions.
isiZulu Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
NCHLT isiZulu Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~ - Resource Catalogue
Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.
NCHLT Setswana Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~ - Resource Catalogue
The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
Lwazi English Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~ - Resource Catalogue
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...