Search
Now showing items 1-10 of 24
NCHLT Sepedi Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~ - Resource Catalogue
The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
Phonetic aligner
(Meraka Institute, CSIR, 2013-07-01) ~ - Resource Index
Scripts for automatic phonetic alignment of speech corpora using hidden markov models (HMMs).
Lwazi II Sepedi TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions.
Lwazi Sepedi Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~ - Resource Catalogue
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
Sepedi Pronunciation Dictionary
(University of Limpopo (Turfloop Campus), 2015-01-27) ~ - Resource Index
A pronunciation dictionary for Sepedi ASR system was created from transcriptions of speech data collected.
NCHLT-inlang Pronunciation Dictionaries
(Meraka Institute, CSIR; North-West University, 2014-07-04) ~ - Resource Catalogue
Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations ...
Lwazi Sepedi TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions
Verbtone Sepedi
(University of the Witwatersrand, 2015-01-27) ~ - Resource Index
Recordings of sentences with verb structures, showing one or two high tones on differing morphological constituent.
Sepedi Speech Corpora
(University of Limpopo (Turfloop Campus), 2015-01-27) ~ - Resource Index
A corpus of Sesotho sa Leboa telephone speech data collected from mother tongue speakers of the standard version of Sesotho sa Leboa for the purpose ...
NCHLT Sepedi Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~ - Resource Catalogue
Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.