Search
Now showing items 1-10 of 30
AuCoPro Splitting Dataset
(North-West University; Centre for Text Technology (CTexT); Tilburg Centre for Cognition and Communication, 2015-01-07) ~ - Resource Catalogue
The AuCoPro Splitting dataset contains compounds annotated with their compound boundaries and linking morphemes for Afrikaans and Dutch.
UNISA Multilingual Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
The resource comprises a diverse selection of TEI P5 marked up documents from institutional origin, written in Tswana, Afrikaans and English. This is ...
Afrikaans Pronunciation Dictionary
(Meraka Institute, CSIR; Stellenbosch University, 2015-01-28) ~ - Resource Index
Pronunciation dictionary compiled from Lwazi ASR dictionary and the "Taalkommissie Korpus". Dictionary to be used for the development of a large vocabulary ...
NCHLT Afrikaans Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
Multilingual Soccer Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
297 English source terms with their equivalents in the ten other official South African languages. On the eve of the 2010 FIFA World Cup, the list was ...
Multilingual Mathematics Terminology List (Grade R - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
984 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
Autshumato English-Afrikaans Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from English (EN-GB) to Afrikaans, in the government domain for use in the Autshumato ITE application.
Afrikaans Part of Speech Data
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
POS annotated data used to train POS tagger. The tagset was specifically designed for Afrikaans and consists of 139 pos-tags.
Afrikaans Wordnet 1.0
(North-West University; Centre for Text Technology (CTexT), 2015-02-05) ~ - Resource Index
The Afrikaans WordNet is a lexical reference source displaying some similarities with dictionaries and thesauri. Together with ALEXANDER, our other ...
Multilingual Information Communication Technology Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
132 English source terms with their equivalents in the ten other official South African languages. Originally initiated by the Department of Communications, ...