Search

Now showing items 11-20 of 74

NCHLT-inlang Pronunciation Dictionaries

Marelie Davel (Meraka Institute, CSIR; North-West University, 2014-07-04) ~ Resource Catalogue

Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations ...

Multilingual Soccer Terminology List

Unknown author (Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ Resource Index

297 English source terms with their equivalents in the ten other official South African languages. On the eve of the 2010 FIFA World Cup, the list was ...

Asterisk Nuance 1.4.

Unknown author (Molo Afrika Speech Technologies, 2015-01-28) ~ Resource Index

Integration of commercial Nuance speech-recognition engine to the Asterisk open-source platform.

Multilingual Mathematics Terminology List (Grade R - 6)

Unknown author (Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ Resource Index

984 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...

Autshumato Xitsonga Monolingual Corpora

Wikus Pienaar; Wildrich Fourie; Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue

Xitsonga monolingual corpus as deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a newline.

Lwazi Xitsonga TTS corpus

Daniel van Niekerk; Etienne Barnard; Marelie Davel; Aby Louw; Alta de Waal (Meraka Institute, CSIR, 2013-03-27) ~ Resource Catalogue

Orthographic and phonemically aligned transcriptions

Lara2

Martin Puttkammer; Martin Schlemmer (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Tool for annotating texts with lemma, part of speech and morphological analysis information

Autshumato English-Xitsonga Parallel Corpora

Wikus Pienaar; Wildrich Fourie; Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2014-12-11) ~ Resource Catalogue

Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.

Mburisano Covid-19 multilingual corpus

Marais, Laurette (CSIR Voice Computing, 2020-12-04)

This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, ...

NCHLT Xitsonga Speech Corpus

Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel; Alta de Waal (Meraka Institute, CSIR; North-West University, 2014-07-08) ~ Resource Catalogue

Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.

View previous page
1
2
3
4
5
. . .
8
View next page