Search
Now showing items 41-50 of 58
Qfrency TTS phone mappings
(CSIR, 2018-03-02) ~ - Resource Index
TTS phone mappings between IPA, XSAMPA and our Qfrency internal format, standardised across all 11 SA languages. To be used in conjunction with the Lwazi ...
CorpusCatcher
(Translate.org.za, 2015-01-28) ~ - Resource Index
Corpus Catcher is a tool that is designed to crawl the web to retrieve data for inclusion in a corpus. It makes use of seed documents/wordlists to ...
Final year high school examination texts of South African home and first additional language subjects
(South African Centre for Digital Language Resources, 2022-11-16)
This data collection consists of reading comprehension and summary
writing texts. The texts comprise of the final year high school exam
texts for ...
Multilingual Illustrated Dictionary with interactive games
(Centre for Text Technology (CTexT); Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Multilingual Illustrated Dictionary with interactive games and pronunciation for 7 of SA's official languages
Bukantswe Sesotho-English Bilingual Dictionary
(North-West University, 2016-07-07) ~ - Resource Catalogue
Bilingual English-Sesotho dictionary. This dataset represents a basic Sesotho dictionary compiled in the creation of a Sesotho language resource. The ...
PSearch 1.1.
(North-West University; Centre for Text Technology (CTexT); Tilburg Centre for Cognition and Communication, 2015-01-30) ~ - Resource Index
PSearch is based on Paramsearch, a tool created by Antal van den Bosch for automatic algorithmic parameter optimisation for TiMBL and other machine ...
EtsaTrans
(University of the Free State, 2015-01-28) ~ - Resource Index
The EtsaTrans machine translation system was build with the intention to do translation of documents from a source language to a target language. Currently ...
Woefzela
(Meraka Institute, CSIR, 2014-07-04) ~ - Resource Catalogue
The primary purpose of the Woefzela software application is to record a list of prompts by a number of different speakers. The resultant output is then ...
Autshumato English-Sesotho Parallel Corpora
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
Aligned parallel corpora for the language pair English-Sesotho. The data is given as two separate UTF-8 text files, with each aligned segment on a ...
CTexTools 2
(North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ - Resource Catalogue
CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...