Search
Now showing items 31-40 of 47
NCHLT Part of Speech Taggers
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Part of speech taggers developed during the NCHLT Text project.
Available for the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, ...
Lwazi II Sotho Pronunciation Dictionaries
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Pronunciation dictionaries for Sepedi, Sesotho and Setswana with and without affricates, as well as the maps that were used to split the affricates into ...
NCHLT Sesotho Morphological Decomposer
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Morphological decomposer developed during the NCHLT Text project.
NCHLT Sesotho Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
Bukantswe Sesotho-English Bilingual Dictionary
(North-West University, 2016-07-07) ~ - Resource Catalogue
Bilingual English-Sesotho dictionary. This dataset represents a basic Sesotho dictionary compiled in the creation of a Sesotho language resource. The ...
Woefzela
(Meraka Institute, CSIR, 2014-07-04) ~ - Resource Catalogue
The primary purpose of the Woefzela software application is to record a list of prompts by a number of different speakers. The resultant output is then ...
NCHLT Sesotho Auxiliary Speech Corpus
(CSIR Meraka Institute; North-West University, 2019-06-01) ~ - Resource Catalogue
The corpus contains orthographically transcribed broadband speech in each of South Africa's eleven official languages. Transcriptions are provided in ...
CTexTools 2
(North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ - Resource Catalogue
CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...
Autshumato Machine Translation Evaluation Set
(North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ - Resource Catalogue
Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
Sesotho function word speech data
(Centre for Text Technology, North-West University, 2019-05-28) ~ - Resource Catalogue
The primary aim of this speech data set was to study the role of tone in the function word "ke" in the minimal pairs "ke motho" and in the function word ...