Search
Now showing items 11-20 of 70
Multilingual Soccer Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
297 English source terms with their equivalents in the ten other official South African languages. On the eve of the 2010 FIFA World Cup, the list was ...
Asterisk Nuance 1.4.
(Molo Afrika Speech Technologies, 2015-01-28) ~ - Resource Index
Integration of commercial Nuance speech-recognition engine to the Asterisk open-source platform.
Multilingual Mathematics Terminology List (Grade R - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
984 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
NCHLT Siswati Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~ - Resource Catalogue
The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
Lara2
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Tool for annotating texts with lemma, part of speech and morphological analysis information
Mburisano Covid-19 multilingual corpus
(CSIR Voice Computing, 2020-12-04)
This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, ...
Lwazi Siswati Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~ - Resource Catalogue
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
Lwazi Siswati TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions
Spelt
(Translate.org.za, 2015-01-28) ~ - Resource Index
Spelt allows a linguist to classify surface forms of words. The word can be associated with a root form and with a word classification. The primary use ...
Monolingual Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)
Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ...