Search
Now showing items 21-30 of 57
Autshumato Text Anonymiser
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...
Autshumato English-isiZulu Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
SADE v.1.0 Platform
(North-West University; Molo Afrika Speech Technologies; IntSyst Labs CC, 2015-09-07) ~ - Resource Catalogue
End-to-end directoy enquiries application (using Asterisk, UniMRPC and Kaldi). The municipality hotline example is implemented as an Asterisk Gateway ...
Lwazi II Cross-lingual Proper Name Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Prompted audio recordings of personal names in different languages, produced by 20 speakers with different language backgrounds.
Bilingual English-isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~ - Resource Catalogue
Aligned parallel corpora for the following language pair: English-isiXhosa.
The data is given as two separate UTF-8 text files, with each segment on a ...
South African Directory Enquiries (SADE) Name Corpus
(North-West University; Molo Afrika Speech Technologies; IntSyst Labs CC, 2015-09-07) ~ - Resource Catalogue
"Audio and tagged orthographic transcriptions of South African names produced by first-language speakers of 4 languages: Afrikaans, English, isiZulu, ...
African Speech Technology Black-English Speech Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2014-12-11) ~ - Resource Catalogue
African Speech Technology speech and transcription data for the Black-English database. The "speech" directory contains English speech as spoken by ...
SPCS Speech Corpus
(Council for Scientific and Industrial Research; North-West University, 2015-11-25) ~ - Resource Catalogue
Broadband speech corpus of approximately 10 hours and the corresponding transcriptions.
The development process of the corpus involved the recording ...
DictionaryMaker
(Meraka Institute, CSIR, 2013-07-15) ~ - Resource Catalogue
The purpose of the DictionaryMaker system is to facilitate the creation of an electronic pronunciation dictionary in a target language, as originally ...
Autshumato Afrikaans-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from Afrikaans to English (EN-GB), in the government domain for use in the Autshumato ITE application.