Search
Now showing items 91-100 of 165
CTexT fastText Skipgram String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)
The CTexT Afrikaans fastText Skipgram String Embeddings is a 300 dimensional Afrikaans embedding model based on the Skipgram fastText architecture that ...
Habakuk
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an ...
Lwazi II Afrikaans TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions
Afrikaanse Speltoetser 3.1
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
Afrikaans spelling checker that is compatible with Microsoft Office 2000 and up. This version of CTexT's well-known spelling checker for Afrikaans now ...
AuCoPro Semantics Dataset
(North-West University; Centre for Text Technology (CTexT); CLiPS Research Center, University of Antwerp, Belgium, 2015-01-07) ~ - Resource Catalogue
The AuCoPro Semantics dataset serves for the automatic semantic analysis of compounds. It contains semantically annotated noun-noun compounds (NN) from ...
Afrikaans 20,000 Word Rule Set (v.2007)
(Meraka Institute, CSIR, 2015-02-05) ~ - Resource Index
Grapheme-to-phoneme rule set based on 20,000-word Afrikaans pronunciation dictionary
Afrikaans Chunker
(University of South Africa, 2015-01-28) ~ - Resource Index
Chunker for Afrikaans based on memory-based machine learning.
NCHLT Afrikaans word2vec-Skipgram embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
NCHLT Afrikaans FLAIR-backward embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Contextual word/string embeddings for the backward flavour of the FLAIR architecture (Akbik et al., 2018). The embedding provides real-valued vector ...
N|uu language archive
(South African Centre for Digital Language Resources, 2022-11-15)
This data collection contains recordings and transcriptions of the
N|uu language. This includes N|uu recordings, South African Nama and
a local variety ...