Search
Now showing items 21-30 of 34
Habakuk
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an ...
NCHLT isiNdebele Morphological Decomposer
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Morphological decomposer developed during the NCHLT Text project.
Sepedi Tokeniser
(University of South Africa, 2015-01-28) ~ - Resource Index
Pre-processing for Sesotho sa Leboa morphology as a disjunctively written language. (morphemes are already separated) as pre-cursor for morphological ...
Afrikaans Chunker
(University of South Africa, 2015-01-28) ~ - Resource Index
Chunker for Afrikaans based on memory-based machine learning.
Calomo
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, ...
Afrikaans Trajectories
(Department of African Languages - University of Pretoria, 2015-01-28) ~ - Resource Index
Frequency analysis of Afrikaans words in the Media24 Archive from 1980-2003 presented in the form of total frequency per word in 5 year intervals as ...
Morphosyntactic Drag-and-Drop Tagger for isiXhosa
(University of South Africa; Gothenburg University, 2015-01-28) ~ - Resource Index
The tagger and the tag set were developed under the auspices of the Spoken Language Corpus Project (Unisa & Gothenburg University) as part of the ...
NCHLT isiXhosa Morphological Decomposer
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Morphological decomposer developed during the NCHLT Text project.
NCHLT Tshivenda Lemmatiser
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatiser developed during the NCHLT Text project.
\n\n
Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one ...
KALAS
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
KALAS is a rule-based compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and ...