Search
Now showing items 201-210 of 227
Qfrency TTS phone mappings
(CSIR, 2018-03-02) ~ - Resource Index
TTS phone mappings between IPA, XSAMPA and our Qfrency internal format, standardised across all 11 SA languages. To be used in conjunction with the Lwazi ...
Lwazi II Sotho Pronunciation Dictionaries
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Pronunciation dictionaries for Sepedi, Sesotho and Setswana with and without affricates, as well as the maps that were used to split the affricates into ...
African Wordnet: Sesotho sa Leboa 1.0
(UNISA, 2017-06-20) ~ - Resource Catalogue
Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
Dictionary of South African English
(Dictionary Unit for South African English; Rhodes University, 2018-02-05) ~ - Resource Index
Full online edition of A Dictionary of South African English on Historical Principles (Silva et al, Oxford University Press, 1996), showing language of ...
NCHLT isiNdebele Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
NCHLT Xitsonga Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
Pretoria Sepedi Corpus POS tagged
(Department of African Languages - University of Pretoria, 2015-01-27) ~ - Resource Index
The tagged Pretoria Sepedi Corpus for part-of-speech (POS) tagging. For grammtical anlysis morphological analysis , lexical , syntax
NCHLT Sesotho Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
NCHLT isiXhosa Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
Bukantswe Sesotho-English Bilingual Dictionary
(North-West University, 2016-07-07) ~ - Resource Catalogue
Bilingual English-Sesotho dictionary. This dataset represents a basic Sesotho dictionary compiled in the creation of a Sesotho language resource. The ...