Search
Now showing items 91-100 of 122
MobiLex and SA Trilingual Wine Industry Dictionary
(Stellenbosch University; Winetech; SAWIS, 2018-02-28) ~ - Resource Index
Trilingual LSP dictionary on cellphone and website
Trilingual LSP dictionary on website
Autshumato English-Xitsonga Manually Translated Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ - Resource Catalogue
Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
African Wordnet: Tshivenda 1.0
(UNISA, 2017-06-20) ~ - Resource Catalogue
Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
NHN Zulu corpora
(University of the Witwatersrand, 2015-01-07) ~ - Resource Index
A first step to building a corpus of POS-annotated Zulu texts.
Multilingual Election Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ - Resource Index
559 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
Siswati Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
Tshivenda Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for Tshivenda.
Pretoria Tshivenda Corpus
(Department of African Languages - University of Pretoria, 2015-01-27) ~ - Resource Index
Collection of texts for general linguistic research, in particular for lexicography
NCHLT Sepedi Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
Autshumato Multilingual Word and Phrase Translations
(North-West University; Centre for Text Technology (CTexT), 2016-01-20) ~ - Resource Catalogue
Word and phrase lists aligned from English to the other official South African languages.