Search
Now showing items 21-30 of 53
CTexT Multilingual Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2015-02-03) ~ - Resource Index
Document level aligned corpora for machine translation purposes.
Afrikaans Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
NCHLT Afrikaans Named Entity Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
Lexi
(WAT (Afrikaans NLU), 2015-01-28) ~ - Resource Index
Citation database that replaces and supplements old index cards. Contains citations and source reference. Citations used in WAT as necessary.
Lwazi II Cross-lingual Proper Name Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Prompted audio recordings of personal names in different languages, produced by 20 speakers with different language backgrounds.
Qfrency TTS Afrikaans Maryna recordings
(CSIR, 2018-03-07) ~ - Resource Index
Studio quality recordings of text-to-speech data in Afrikaans and some English utterances. Professional Afrikaans first language voice artist.
Afrikaans Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for Afrikaans.
Qfrency TTS Afrikaans Kobus recordings
(CSIR, 2018-03-07) ~ - Resource Index
Studio quality recordings of text-to-speech data in Afrikaans and some English utterances. Professional Afrikaans first language voice artist.
Lwazi Afrikaans Pronunciation Dictionary
(Meraka Institute, CSIR, 2013-04-01) ~ - Resource Catalogue
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
South African Directory Enquiries (SADE) Name Corpus
(North-West University; Molo Afrika Speech Technologies; IntSyst Labs CC, 2015-09-07) ~ - Resource Catalogue
"Audio and tagged orthographic transcriptions of South African names produced by first-language speakers of 4 languages: Afrikaans, English, isiZulu, ...