Search
Now showing items 41-50 of 87
NCHLT isiNdebele Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
Lwazi isiNdebele ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~ - Resource Catalogue
Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
Rhonda
(CSIR, 2018-03-02) ~ - Resource Index
Rhonda is a speech to speech translation system that connects individual HLTs together in a secure and accessible way. A user may access individual HLTs ...
Autshumato Monolingual isiNdebele Corpus
(North-West University; Centre for Text Technology (CTexT), 2021-01-31)
Monolingual corpus for isiNdebele. The data is given as a single UTF-8 text file, with each segment on a newline.
NCHLT isiNdebele Named Entity Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
NCHLT isiNdebele Morphological Decomposer
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Morphological decomposer developed during the NCHLT Text project.
NCHLT isiNdebele Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~ - Resource Catalogue
The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
Speect
(Meraka Institute, CSIR, 2013-07-15) ~ - Resource Catalogue
Speect is a multilingual text-to-speech (TTS) system. It offers a full TTS system (text analysis which decodes the text, and speech synthesis, which ...
Multilingual Life Orientation Intermediate Phase Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
1628 English source terms with their equivalents in the ten other official South African languages. The terms were excerpted from life orientation ...
Multilingual Parliamentary / Political Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
502 English source terms with their equivalents in the ten other official South African languages. The project built on a 2003 initiative of the national ...