In 2009, the South African National HLT Network (NHN) funded a technology audit that was conducted to form a clear profile of the research and development activities in the human language technology field in South Africa. This audit was used as the basis for the RMA Index, which is a list of South African resources with the relevant metadata (information such as developer details and specifications). Some of these resources are included in the RMA Catalogue, and are therefore available for download.

Collections in this community

  • Resource Catalogue [215]

    A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture.
  • Resource Index [351]

    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Recent Submissions

  • Autshumato Machine Translation Evaluation Set 

    McKellar, Cindy Arlene (North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ Resource Catalogue
    Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
  • Qfrency TTS phone mappings 

    Unknown author (CSIR, 02 03 2018) ~ Resource Index
    TTS phone mappings between IPA, XSAMPA and our Qfrency internal format, standardised across all 11 SA languages. To be used in conjunction with the Lwazi ...
  • Qfrency TTS Afrikaans Maryna recordings 

    Unknown author (CSIR, 07 03 2018) ~ Resource Index
    Studio quality recordings of text-to-speech data in Afrikaans and some English utterances. Professional Afrikaans first language voice artist.
  • Qfrency TTS Afrikaans Kobus recordings 

    Unknown author (CSIR, 07 03 2018) ~ Resource Index
    Studio quality recordings of text-to-speech data in Afrikaans and some English utterances. Professional Afrikaans first language voice artist.
  • GF Miniature Resource for Tswana 

    Laurette Marais, Meraka, et al. (HLT Research Group, Meraka Institute, CSIR, 06 03 2018) ~ Resource Index
    This miniature resource grammar parses and generates main clause sentences in various tenses, moods and aspects in Tswana. The lexicon is limited, but ...
  • NCHLT Text Web Services 

    Roald Eiselen (SADiLaR; North-West University, 01 03 2018) ~ Resource Index
    A web service that provides access to seven core technologies in ten South African languages, including: * Tokenisers * Sentence separators * ...
  • Autshumato Machine Translation Web Service (MTWS) 

    Wildrich Fourie, et al. (Centre for Text Technology; North-West University, 01 03 2018) ~ Resource Index
    The MTWS is a unified interface through which anyone can gain access to the MT systems developed in the Autshumato project. It can provide sentence, ...
  • TsnMorph 

    Laurette Pretorius, et al. (University of South Africa, 01 03 2018) ~ Resource Index
    Finite-state morphological analyser for Tswana based on the Xerox toolkit and compatible with foma
  • ZulMorph 

    Laurette Pretorius, et al. (University of South Africa, 01 03 2018) ~ Resource Index
    Finite-state morphological analyser for Zulu based on the Xerox toolkit and compatible with foma
  • Test treebank for the LFG/XLE treebank 

    Unknown author (University of South Africa, 01 03 2018) ~ Resource Index
    A selection of 828 Tswana sentences with their LFG/XLE parse trees

View more