Search
Now showing items 31-40 of 85
CTexT Multilingual Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2015-02-03) ~ - Resource Index
Document level aligned corpora for machine translation purposes.
Lwazi II Cross-lingual Proper Name Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Prompted audio recordings of personal names in different languages, produced by 20 speakers with different language backgrounds.
Generic Bilingual Academic Wordlist with Definitions
(ICELDA; SADiLaR, 2021)
The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ...
Qfrency TTS Afrikaans Maryna recordings
(CSIR, 2018-03-07) ~ - Resource Index
Studio quality recordings of text-to-speech data in Afrikaans and some English utterances. Professional Afrikaans first language voice artist.
AST Prototype Demonstrators
(CatchWord, 2013-07-01) ~ - Resource Index
South African English prototype of an automated hotel information and reservation system developed during the AST project. The prototype system was ...
Afrikaans-English-Xhosa-Zulu Aid on Cellphone
(Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Translations for words in 4 languages
Qfrency TTS Afrikaans Kobus recordings
(CSIR, 2018-03-07) ~ - Resource Index
Studio quality recordings of text-to-speech data in Afrikaans and some English utterances. Professional Afrikaans first language voice artist.
South African Directory Enquiries (SADE) Name Corpus
(North-West University; Molo Afrika Speech Technologies; IntSyst Labs CC, 2015-09-07) ~ - Resource Catalogue
"Audio and tagged orthographic transcriptions of South African names produced by first-language speakers of 4 languages: Afrikaans, English, isiZulu, ...
Human Language Technology Audit 2017/18
(CSIR, 2018-08-31)
This document reports on all work conducted in the 2017/18 Audit of human language technology (HLT) resources available in South Africa project. The ...
Autshumato English-Afrikaans Parallel Corpora
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
Aligned parallel corpora for the language pair English-Afrikaans. The data is given as two separate UTF-8 text files, with each aligned segment on a ...