Search
Now showing items 1-10 of 19
Format Normaliser 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Normalises input files to txt, utf8, replaces smart quotes with straight quotes, removes empty lines, etc.
Ragel
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Ragel was developed by using traditional methods for stemming/lemmatisation (i.e. affix stripping), and consists of language-specific rules for identifying ...
Homophone Disambiguatior 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Developed to disambiguate Afrikaans homophones.
TTS - South African English
(CatchWord, 2015-01-28) ~ - Resource Index
Domain independent TTS system embedded in mobile phone platforms (PDAs, smart phones). , HMM-based speech synthesis. English is male voice. For use in ...
CKarma
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
CKarma is a compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and produces ...
Sepedi Grapheme-to-Phoneme Converter
(University of South Africa, 2015-01-28) ~ - Resource Index
Converting morphemes of Sesotho sa Leboa to phonological representations.
Morphosyntactic Tag Set for isiXhosa
(University of South Africa; Gothenburg University, 2015-01-28) ~ - Resource Index
The tagger and the tag set were developed under the auspices of the Spoken Language Corpus Project (Unisa & Gothenburg University) as part of the ...
Hyphenator 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Rule-based hyphenator which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any ...
South African English Language Pack
(CatchWord, 2015-01-28) ~ - Resource Index
Speech recognition language pack (language models grammars, lexicons, ) for use with TeliSpeech ™ of Telisma
Sepedi Part of Speech Tagger
(Department of African Languages - University of Pretoria, 2015-01-28) ~ - Resource Index
Sesotho sa Leboa part of speech statistical tagger compiled with stochastic tagger of Helmut Scmidt supported by noun and verb guessing modules, tokenizer, ...