Search
Now showing items 1-5 of 5
Format Normaliser 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Normalises input files to txt, utf8, replaces smart quotes with straight quotes, removes empty lines, etc.
Sepedi Grapheme-to-Phoneme Converter
(University of South Africa, 2015-01-28) ~ - Resource Index
Converting morphemes of Sesotho sa Leboa to phonological representations.
Hyphenator 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Rule-based hyphenator which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any ...
Sepedi Part of Speech Tagger
(Department of African Languages - University of Pretoria, 2015-01-28) ~ - Resource Index
Sesotho sa Leboa part of speech statistical tagger compiled with stochastic tagger of Helmut Scmidt supported by noun and verb guessing modules, tokenizer, ...
Sepedi Tokeniser
(University of South Africa, 2015-01-28) ~ - Resource Index
Pre-processing for Sesotho sa Leboa morphology as a disjunctively written language. (morphemes are already separated) as pre-cursor for morphological ...