Search
Now showing items 221-230 of 238
Autshumato Machine Translation Evaluation Set
(North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ - Resource Catalogue
Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
Afrikaans text unit identification data
(Centre for Text Technology, North-West University, 2006) ~ - Resource Catalogue
This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ...
Setswana Test suite and Treebank
(North-West University, 2018-03-27) ~ - Resource Catalogue
The main aim of the PhD study "A computational syntactic analysis of Setswana"(AS Berg, May 2018) is the computational syntactic analysis of the Setswana ...
Sesotho function word speech data
(Centre for Text Technology, North-West University, 2019-05-28) ~ - Resource Catalogue
The primary aim of this speech data set was to study the role of tone in the function word "ke" in the minimal pairs "ke motho" and in the function word ...
Afribooms Afrikaans Dependency Treebank
(North-West University; Centre for Text Technology (CTexT); Katholieke Universiteit Leuven (Belgium), 2015-02-10) ~ - Resource Catalogue
This is the annotated corpus developed for Afrikaans for the Afribooms project. The corpus includes annotations for lemma, part-of-speech (POS) and ...
Read Afrikaans Normal/ Read Afrikaans Fast
(Centre for Text Technology, North-West University, 2019-05-28) ~ - Resource Catalogue
The corpus contains speech of 127 mother tongue speakers of Afrikaans. Every speaker was asked to read a text fragment from a book or a newspaper (about ...
Lagos-NWU Yoruba Speech Corpus
(North-West University; Centre for Text Technology (CTexT); University of Lagos (Nigeria), 2015-02-06) ~ - Resource Catalogue
This speech corpus consisting of 16 female speakers and 17 male speakers was recorded in Lagos, Nigeria for the purpose of speech recognition research. ...
Sesotho vowel speech data set
(Centre for Text Technology, North-West University, 2019-05-28) ~ - Resource Catalogue
The primary aim of this speech dataset was to collect a representative set of words in which all the Sesotho vowels are present. Some of them are ...
Sesotho tone data set
(Centre for Text Technology, North-West University, 2019-05-28) ~ - Resource Catalogue
These recordings are of male and female speakers (11 for tasks 1 and 2; 10 for task 3) of the QwaQwa region (Eastern Free State). Ages of the speakers ...
W-NORM
(North-West University; Centre for Text Technology (CTexT), 2015-06-30) ~ - Resource Catalogue
W-NORM is a graphical user interface (GUI), written in Perl and GTK2, for the Vowels 1.2 package. Vowels 1.2 is written in the R programming language ...