Search
Now showing items 41-50 of 227
NCHLT Sesotho Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~ - Resource Catalogue
Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.
NCHLT isiZulu Named Entity Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
NCHLT isiXhosa Named Entity Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
Lwazi III isiZulu TTS Corpus
(Meraka Institute, CSIR, 2016-06-17) ~ - Resource Catalogue
Complete audio recordings with orthographic transcriptions. TTS corpus for standard SA dialect. This corpus was created to enable the building of a TTS voice.
NCHLT Setswana Annotated Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
Lwazi Setswana TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions
Multilingual Mathematics Terminology List (Grade R - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
984 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
Pretoria Sepedi Corpus (Gold Standard)
(Department of African Languages - University of Pretoria, 2015-01-27) ~ - Resource Index
A section of the Pretoria Sepedi Corpus for POS, manually checked for POS tags.
Tshivenda Speech Corpus
(University of Limpopo (Turfloop Campus), 2015-01-27) ~ - Resource Index
A telephone speech database for Tshivenda ASR system was created from transcriptions of speech data collected. Prompted and some some speech
African Speech Technology Afrikaans-Afrikaans Speech Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2014-12-10) ~ - Resource Catalogue
African Speech Technology speech and transcription data for the Afrikaans-Afrikaans database. The "speech" directory contains Afrikaans speech as spoken ...