Search
Now showing items 81-90 of 107
African Speech Technology Afrikaans Text Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ - Resource Catalogue
Monolingual text corpus developed during the African Speech Technology project.
Little Dictionary on Cellphone
(Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Afrikaans-English dictionary with 30 000 lemmas suitable for general market
NCHLT Afrikaans fastText-CBoW embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word and subword embeddings for the continuous bag of words (CBoW) flavour of the fastText architecture (Bojanowski et al., 2017).
The embedding ...
KALAS
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
KALAS is a rule-based compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and ...
Final year high school examination texts of South African home and first additional language subjects
(South African Centre for Digital Language Resources, 2022-11-16)
This data collection consists of reading comprehension and summary
writing texts. The texts comprise of the final year high school exam
texts for ...
NCHLT Part of Speech Taggers
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Part of speech taggers developed during the NCHLT Text project.
Available for the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, ...
WAT quotation collection
(N/A, 2022-10-14)
Collection of short quotations/excerpts from a variety of books (fiction, non-fiction & academic).
Multilingual Illustrated Dictionary with interactive games
(Centre for Text Technology (CTexT); Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Multilingual Illustrated Dictionary with interactive games and pronunciation for 7 of SA's official languages
Autshumato Monolingual Afrikaans Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
Monolingual corpus for Afrikaans. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
NCHLT Afrikaans Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...