Search
Now showing items 1-10 of 13
Format Normaliser 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Normalises input files to txt, utf8, replaces smart quotes with straight quotes, removes empty lines, etc.
Ragel
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Ragel was developed by using traditional methods for stemming/lemmatisation (i.e. affix stripping), and consists of language-specific rules for identifying ...
Homophone Disambiguatior 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Developed to disambiguate Afrikaans homophones.
CKarma
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
CKarma is a compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and produces ...
Sepedi Grapheme-to-Phoneme Converter
(University of South Africa, 2015-01-28) ~ - Resource Index
Converting morphemes of Sesotho sa Leboa to phonological representations.
Hyphenator 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Rule-based hyphenator which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any ...
Habakuk
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an ...
Sepedi Tokeniser
(University of South Africa, 2015-01-28) ~ - Resource Index
Pre-processing for Sesotho sa Leboa morphology as a disjunctively written language. (morphemes are already separated) as pre-cursor for morphological ...
Afrikaans Chunker
(University of South Africa, 2015-01-28) ~ - Resource Index
Chunker for Afrikaans based on memory-based machine learning.
Calomo
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, ...