SADiLaR Language Resource Repository: Recent submissions
Now showing items 121-130 of 529
-
Afrikaans morphological evaluative constructions dataset
(North-West University, 2022-11-04)A dataset of Afrikaans morphological evaluative constructions (MECs) and their word frequency classes. The MECs have been compiled using extracted ... -
WAT quotation collection
(N/A, 2022-10-14)Collection of short quotations/excerpts from a variety of books (fiction, non-fiction & academic). -
WAT word slips collection
(Bureau of the WAT, 2022-10-14)Approximately 3,5 million index cards containing Afrikaans words (lemmas), meanings and expressions, as well as example sentences (quotations) that has ... -
N|uu language archive
(Collins, Chris; Sands, Bonny; Jones, Kerry, 2022-08-11)This collection contains information that forms the basis of the N|uu dictionary which contains a word list for N|uu with translations into Afrikaans, ... -
CGE's Sesotho Gender Terminology List
(Commission for Gender Equality (CGE), 2018)CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ... -
Proof of concept: Afrikaans English Venda E-dictionary
(Published as a Lexonomy dictionary (https://www.lexonomy.eu/POCVenEngAfr/), 2022-03-04)This proof of concept is a result of an experiment to compile a trilingual e-dictionary for Afrikaans, Venda and English. It includes 613 items and is ... -
Bilingual English-Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)Aligned parallel corpora for the following language pair: English-SiSwati. The data is given as four separate UTF-8 text files, with each segment on a ... -
Monolingual Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ... -
South African Multilingual Learner Corpus of Academic Texts (SAMuLCAT)
(ICELDA; SADiLaR, 2021)NOTE: THIS HAS BEEN SUPERSEDED. See https://hdl.handle.net/20.500.12185/585 The South African Multilingual Learner Corpus of Academic Texts (SAMuLCAT) ... -
Sesotho syllabification systems
(South African Centre for Digital Language Resources, 2022-02-03)This package contains two syllabification systems for Sesotho (rule-based and TeX-based).