Search
Now showing items 61-70 of 79
CGE's Afrikaans Gender Terminology List
(Commission for Gender Equality (CGE), 2021-04)
CGE's Afrikaans Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 436 words or phrases ...
NCHLT Afrikaans GloVe embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word embedding model based on the Global Vectors architecture (Pennington et al., 2014). The embeddings provide real-valued vector representations ...
Woefzela
(Meraka Institute, CSIR, 2014-07-04) ~ - Resource Catalogue
The primary purpose of the Woefzela software application is to record a list of prompts by a number of different speakers. The resultant output is then ...
Autshumato ITE
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Integrated Translation Environment. Combines multiple translation tools into one environment.
USAf National Language Resources Audit 2023
(South African Centre for Digital Language Resources, 2023-10)
This report documents the findings of a comprehensive language resources audit conducted by the South African Centre for Digital Language Resources ...
Generic Multilingual Academic Wordlists with Definitions
(SADiLaR; ICELDA, 2022)
This multilingual generic academic wordlist has been developed to serve as a resource to students to assist with building a vocabulary and decoding ...
CTexTools 2
(North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ - Resource Catalogue
CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...
Autshumato Machine Translation Evaluation Set
(North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ - Resource Catalogue
Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
Afrikaans text unit identification data
(Centre for Text Technology, North-West University, 2006) ~ - Resource Catalogue
This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ...
CTexT Afrikaans fastText CBoW String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)
The CTexT Afrikaans fastText CBoW String Embeddings is a 300 dimensional Afrikaans embedding model based on the Contunious Bag of Words fastText ...