Search
Now showing items 1-10 of 73
Format Normaliser 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Normalises input files to txt, utf8, replaces smart quotes with straight quotes, removes empty lines, etc.
Ragel
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Ragel was developed by using traditional methods for stemming/lemmatisation (i.e. affix stripping), and consists of language-specific rules for identifying ...
Pharos Speltoetser en Woordafbreker
(Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Corrects typing and spelling errors and hyphenate words correctly
UNISA Multilingual Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
The resource comprises a diverse selection of TEI P5 marked up documents from institutional origin, written in Tswana, Afrikaans and English. This is ...
Afrikaans multi-speaker TTS corpus
(MuST, NWU, 2018-02-27) ~ - Resource Index
The aim of this corpus was to investigate the implementation of a high-quality TTS system using multiple voices recorded using a low-cost process (i.e. ...
Phonetic aligner
(Meraka Institute, CSIR, 2013-07-01) ~ - Resource Index
Scripts for automatic phonetic alignment of speech corpora using hidden markov models (HMMs).
TurboAnnotate1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
TurboAnnotate is a user-friendly annotating environment (i.e. tool) for bootstrapping linguistic data for machine-learning purposes, or for manually ...
GNApp (VSep2009)
(Meraka Institute, CSIR, 2013-07-01) ~ - Resource Index
An augmentative and alternate communication (AAC) device which generates synthesised (or pre-recorded) speech as output based on icons. Available as a ...
South African Fonts
(Translate.org.za, 2015-01-28) ~ - Resource Index
The South African fonts collection is a set of open fonts that cover all characters needed by all 11 South African languages. The fonts ensure that all ...
Homophone Disambiguatior 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Developed to disambiguate Afrikaans homophones.