Search
Now showing items 11-20 of 54
Sepedi Grapheme-to-Phoneme Converter
(University of South Africa, 2015-01-28) ~ - Resource Index
Converting morphemes of Sesotho sa Leboa to phonological representations.
Autshumato English-Sesotho sa Leboa Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from English (EN-GB) to Sesotho sa Leboa, in the government domain for use in the Autshumato ITE application.
Lara2
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Tool for annotating texts with lemma, part of speech and morphological analysis information
Autshumato English-Sesotho sa Leboa Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
NCHLT Sepedi Lemmatiser
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatiser developed during the NCHLT Text project.
\n\n
Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one ...
Multilingual Information Communication Technology Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
132 English source terms with their equivalents in the ten other official South African languages. Originally initiated by the Department of Communications, ...
Hyphenator 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Rule-based hyphenator which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any ...
Tagger Parameter file for RF-Tagger (Schmid and Laws 2005)
(Institute for Information Science and Natural Language Processing, University of Hildesheim, Germany, 2018-02-21) ~ - Resource Index
The tagger parameter file is trained on an excerpt of the Pretoria Sepedi Corpus (D. Prinsloo, University of Pretoria): Here, about 5000 tokens were ...
Multilingual HIV/AIDS Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ - Resource Index
586 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
NCHLT Sepedi Morphological Decomposer
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Morphological decomposer developed during the NCHLT Text project.