Welcome to the Language Resource Management Agency of SADiLaR. This repository provides access to all of the collections, data sets, tools and other language resources that are distributed by SADiLaR.

The repository will eventually replace all of the functionality of the original RMA site, with all of the resources available from the RMA, also available from this repository.

Select a community to browse its collections.

Language Resource Management Agency [411]
  • Final year high school examination texts of South African home and first additional language subjects 

    Sibeko, Johannes, et al. (South African Centre for Digital Language Resources, 2022-11-16)
    This data collection consists of reading comprehension and summary writing texts. The texts comprise of the final year high school exam texts for ...
  • N|uu language archive 

    Collins, Christopher T, et al. (South African Centre for Digital Language Resources, 2022-11-15)
    This data collection contains recordings and transcriptions of the N|uu language. This includes N|uu recordings, South African Nama and a local variety ...
  • Afrikaans morphological evaluative constructions dataset 

    Benito, Trollip (North-West University, 2022-11-04)
    A dataset of Afrikaans morphological evaluative constructions (MEC's) and their word frequency classes. The MEC's have been compiled using extracted ...
  • WAT quotation collection 

    Bureau of the WAT (N/A, 2022-10-14)
    Collection of short quotations/excerpts from a variety of books (fiction, non-fiction & academic).
  • WAT word slips collection 

    Bureau of the WAT (Bureau of the WAT, 2022-10-14)
    Approximately 3,5 million index cards containing Afrikaans words (lemmas), meanings and expressions, as well as example sentences (quotations) that has ...
  • N|uu language archive 

    Collins, Chris, et al. (Collins, Chris; Sands, Bonny; Jones, Kerry, 2022-08-11)
    This collection contains information that forms the basis of the N|uu dictionary which contains a word list for N|uu with translations into Afrikaans, ...
  • CGE's Sesotho Gender Terminology List 

    Commission for Gender Equality (CGE), et al. (Commission for Gender Equality (CGE), 2018)
    CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ...
  • Proof of concept: Afrikaans English Venda E-dictionary 

    Bosch, Sonja, et al. (Published as a Lexonomy dictionary (https://www.lexonomy.eu/POCVenEngAfr/), 2022-03-04)
    This proof of concept is a result of an experiment to compile a trilingual e-dictionary for Afrikaans, Venda and English. It includes 613 items and is ...
  • Bilingual English-Siswati Corpus 

    McKellar, Cindy (North-West University - Centre for Text Technology (CTexT), 2022-03-31)
    Aligned parallel corpora for the following language pair: English-SiSwati. The data is given as four separate UTF-8 text files, with each segment on a ...
  • Monolingual Siswati Corpus 

    McKellar, Cindy (North-West University - Centre for Text Technology (CTexT), 2022-03-31)
    Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ...

View more