Search
Now showing items 21-30 of 55
IsiXhosa multi-speaker TTS corpus
(MuST, NWU, 2018-02-27) ~ - Resource Index
The aim of this corpus was to investigate the implementation of a high-quality TTS system using multiple voices recorded using a low-cost process (i.e. ...
Multilingual Information Communication Technology Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
132 English source terms with their equivalents in the ten other official South African languages. Originally initiated by the Department of Communications, ...
South African Broadcast News (SABN) Corpus
(Stellenbosch University; CSIR, 2018-02-27) ~ - Resource Index
The corpus consists of approximately 20 hours of audio recordings from one of the country's main radio news channels, SAFM. Bulletins ...
Tagger Parameter file for RF-Tagger (Schmid and Laws 2005)
(Institute for Information Science and Natural Language Processing, University of Hildesheim, Germany, 2018-02-21) ~ - Resource Index
The tagger parameter file is trained on an excerpt of the Pretoria Sepedi Corpus (D. Prinsloo, University of Pretoria): Here, about 5000 tokens were ...
Multilingual HIV/AIDS Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ - Resource Index
586 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
DSAE Citations Database
(Dictionary Unit for South African Languages, 2015-01-28) ~ - Resource Index
Citations contained in A Dictionary of South African English on Historical Principles (1996) covering English usage in SA up to 1995, plus new material ...
Multilingual Natural Sciences & Technology Terminology List (Grade 4 - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
2756 English source terms with their equivalents in the ten other official South African languages. The list was populated from terms excerpted from ...
CTexT Multilingual Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2015-02-03) ~ - Resource Index
Document level aligned corpora for machine translation purposes.
Unisa South African Spoken and Signed Language Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
This resource comprises annotated transcriptions of audio and video segments of the Xhosa section of the spoken corpus project SOUTHTALK (Southern African ...
UNISA English/Zulu Parallel Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
The resource comprises sentence aligned and tokenized parallel text in English and Zulu. The text was extracted from the following sources: an adapted ...