Search
Now showing items 1-10 of 16
Lwazi Setswana ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~ - Resource Catalogue
Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
NCHLT-inlang Pronunciation Dictionaries
(Meraka Institute, CSIR; North-West University, 2014-07-04) ~ - Resource Catalogue
Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations ...
Lwazi Setswana TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions
High quality TTS data for four South African languages (af, st, tn, xh)
(Google; North-West University, 2017) ~ - Resource Catalogue
This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ...
Corpus of multilingual code-switched soap opera speech
(Stellenbosch University, 2020-02-28)
The corpus comprises 26.9 hours of annotated multilingual speech that contains examples of code-switching in isiZulu, isiXhosa, Setswana, Sesotho and ...
NCHLT Setswana Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~ - Resource Catalogue
The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
Lwazi II Setswana TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions.
Lwazi Setswana Pronuncation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~ - Resource Catalogue
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
PHONAAS
(North-West University; Centre for Text Technology (CTexT), 2015-06-30) ~ - Resource Catalogue
PHONAAS is a graphical user interface (GUI) tool, written in Perl and GTK2, using the R programming language and PRAAT to extract vowel formant data.
W-NORM
(North-West University; Centre for Text Technology (CTexT), 2015-06-30) ~ - Resource Catalogue
W-NORM is a graphical user interface (GUI), written in Perl and GTK2, for the Vowels 1.2 package. Vowels 1.2 is written in the R programming language ...