In 2009, the South African National HLT Network (NHN) funded a technology audit that was conducted to form a clear profile of the research and development activities in the human language technology field in South Africa. This audit was used as the basis for the RMA Index, which is a list of South African resources with the relevant metadata (information such as developer details and specifications). Some of these resources are included in the RMA Catalogue, and are therefore available for download.

Collections in this community

  • Resource Catalogue [243]

    A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture.
  • Resource Index [381]

    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.
  • Student Data Repository [6]

    A collection of language resources available as part of the output of post-graduate study programs

Recent Submissions

  • English-IsiNdebele Glossary of Medical Terms 

    Malele, Nomsebenzi (University of South Africa (UNISA), 2021-09-01)
    This is the PhD project, where English-isiNdebele glossary of medical terms was compiled by a PhD candidate.
  • Linguistically enriched corpora for conjunctively written South African languages 

    Puttkammer, Martin, et al. (North-West University, Centre for Language Technology (CTexT), 2021-09)
    This resource contains linguistically annotated data for four official South African languages with a conjunctive orthography from the Nguni family ...
  • Corpus of multilingual code-switched soap opera speech 

    van der Westhuizen, Ewald, et al. (Stellenbosch University, 2020-02-28)
    The corpus comprises 26.9 hours of annotated multilingual speech that contains examples of code-switching in isiZulu, isiXhosa, Setswana, Sesotho and ...
  • COVID-19 Multilingual Terminology 

    City of Tshwane, et al. (City of Tshwane; South African Centre for Digital Language Resources (SADiLaR); Department of Science and Innovation; Pan South African Language Board (PanSALB), 2021-07)
    COVID-19 multilingual terminology list document in all the South African languages. The development of this terminology list was initiated by City of ...
  • CGE's Afrikaans Gender Terminology List 

    Commission for Gender Equality (CGE), et al. (Commission for Gender Equality (CGE), 2021-04)
    CGE's Afrikaans Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 436 words or phrases ...
  • Human Language Technology Audit 2017/18 

    Moors, Carmen, et al. (CSIR, 2018-08-31)
    This document reports on all work conducted in the 2017/18 Audit of human language technology (HLT) resources available in South Africa project. The ...
  • Code-switching among bilingual Afrikaans-Dutch children 

    Rabe, Monique (North-West University, 2021)
    To describe the code-switching in the speech of pre-school, bilingual Afrikaans-Dutch children, novel language data of eight preschool children between ...
  • Generic Bilingual Academic Wordlist with Definitions 

    ICELDA, et al. (ICELDA; SADiLaR, 2021)
    The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ...
  • Description of N|uu 

    Sands, Bonny, et al. (Bonny Sands, 2015-10-06)
    Recordings of dictionary entries for a pan-dialectal dictionary of the N|uu language (Eastern and Western dialects) made by Bonny Sands, Johanna Brugman, ...
  • Mburisano Covid-19 multilingual corpus 

    Marais, Laurette (CSIR Voice Computing, 2020-12-04)
    This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, ...

View more