Browsing Language Resource Management Agency by Title
Filter by:
Now showing items 210-229 of 527
-
Lwazi Tshivenda ASR corpus
(Meraka Institute, CSIR, 2013-06-24) ~Resource Catalogue Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems. -
Lwazi Tshivenda Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~Resource Catalogue General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ... -
Lwazi Tshivenda TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~Resource Catalogue Orthographic and phonemically aligned transcriptions -
Lwazi Xitsonga ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~Resource Catalogue Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems. -
Lwazi Xitsonga Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~Resource Catalogue General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ... -
Lwazi Xitsonga TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~Resource Catalogue Orthographic and phonemically aligned transcriptions -
Mburisano Covid-19 multilingual corpus
(CSIR Voice Computing, 2020-12-04)This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, ... -
Media24 Afrikaans Test Corpus
(Department of African Languages - University of Pretoria, 2015-01-27) ~Resource Index Collection of texts from Media24 newspaper texts for building lexicographic modules such as the Afrikaans Trajectories -
MELFA (Mobile E-Learning for Africa)
(CatchWord; Stellenbosch University; Sensus; Copenhagen University, 2013-07-01) ~Resource Index Embedded TTS on mobile phones providing speech output of full texts for various types of applications. -
MobiLex and SA Trilingual Wine Industry Dictionary
(Stellenbosch University; Winetech; SAWIS, 2018-02-28) ~Resource Index Trilingual LSP dictionary on cellphone and website Trilingual LSP dictionary on website -
Monolingual isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~Resource Catalogue Monolingual corpus for isiXhosa. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data ... -
Monolingual Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ... -
Morphologically annotated corpus for isiNdebele
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in isiNdebele converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data ... -
Morphologically annotated corpus for isiXhosa
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in isiXhosa converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ... -
Morphologically annotated corpus for isiZulu
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in isiZulu converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ... -
Morphologically annotated corpus for Sepedi
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in Sepedi converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ... -
Morphologically annotated corpus for Sesotho
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in Sesotho converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ... -
Morphologically annotated corpus for Setswana
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in Setswana converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ... -
Morphologically annotated corpus for Siswati
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in Siswati converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ... -
Morphologically annotated corpus for Tshivenḓa
(Centre for Text Technology (CTexT), 2024-01-31)NCHLT corpus of morphologically annotated tokens in Tshivenḓa converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...