Search
Now showing items 81-90 of 143
NCHLT Afrikaans Annotated Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
Habakuk
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an ...
Lwazi II Afrikaans TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions
Afrikaanse Speltoetser 3.1
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
Afrikaans spelling checker that is compatible with Microsoft Office 2000 and up. This version of CTexT's well-known spelling checker for Afrikaans now ...
AuCoPro Semantics Dataset
(North-West University; Centre for Text Technology (CTexT); CLiPS Research Center, University of Antwerp, Belgium, 2015-01-07) ~ - Resource Catalogue
The AuCoPro Semantics dataset serves for the automatic semantic analysis of compounds. It contains semantically annotated noun-noun compounds (NN) from ...
Afrikaans 20,000 Word Rule Set (v.2007)
(Meraka Institute, CSIR, 2015-02-05) ~ - Resource Index
Grapheme-to-phoneme rule set based on 20,000-word Afrikaans pronunciation dictionary
Afrikaans Chunker
(University of South Africa, 2015-01-28) ~ - Resource Index
Chunker for Afrikaans based on memory-based machine learning.
N|uu language archive
(South African Centre for Digital Language Resources, 2022-11-15)
This data collection contains recordings and transcriptions of the
N|uu language. This includes N|uu recordings, South African Nama and
a local variety ...
Autshumato English-Afrikaans Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
Lwazi Afrikaans TTS corpus
(Meraka Institute, CSIR, 2013-03-27) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions