Search
Now showing items 21-30 of 62
Multilingual Information Communication Technology Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
132 English source terms with their equivalents in the ten other official South African languages. Originally initiated by the Department of Communications, ...
Hyphenator 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Rule-based hyphenator which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any ...
Multilingual HIV/AIDS Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ - Resource Index
586 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
NCHLT isiZulu Lemmatiser
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatiser developed during the NCHLT Text project.
\n\n
Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one ...
Multilingual Natural Sciences & Technology Terminology List (Grade 4 - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
2756 English source terms with their equivalents in the ten other official South African languages. The list was populated from terms excerpted from ...
Autshumato Text Anonymiser
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...
CTexT Multilingual Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2015-02-03) ~ - Resource Index
Document level aligned corpora for machine translation purposes.
UNISA English/Zulu Parallel Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
The resource comprises sentence aligned and tokenized parallel text in English and Zulu. The text was extracted from the following sources: an adapted ...
Autshumato English-isiZulu Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
Translate.org.za isiZulu - isiXhosa Corpus 2012
(Translate.org.za, 2013-06-19) ~ - Resource Catalogue
isiZulu-isiXhosa translation memory.