Filter by:

Now showing items 71-90 of 350

Filter options

    • Denominal adjectives in Afrikaans dataset 

      Trollip, Benito (South African Centre for Digital Language Resources, 2020-05-15) ~ Resource Catalogue
      This dataset contain a collection of Afrikaans denominal adjectives that were extracted from the Virtual Institute for Afrikaans' corpus portal. The ...
    • DictionaryMaker 

      Marelie Davel, et al. (Meraka Institute, CSIR, 2013-07-15) ~ Resource Catalogue
      The purpose of the DictionaryMaker system is to facilitate the creation of an electronic pronunciation dictionary in a target language, as originally ...
    • Ex Machina: Using NLP and statistical learning models to model eyewitness statements and choosing behaviour 

      Nortje, Alicia, et al. (Sadilar, 2019-05-07)
      This curated database includes data from various of empirical studies where eyewitness statements and descriptions were collected. The original studies, ...
    • Generic Bilingual Academic Wordlist with Definitions 

      ICELDA, et al. (ICELDA; SADiLaR, 2021)
      The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ...
    • Generic Multilingual Academic Wordlists with Definitions 

      Van Dyk, Tobie (SADiLaR; ICELDA, 2022)
      This multilingual generic academic wordlist has been developed to serve as a resource to students to assist with building a vocabulary and decoding ...
    • High quality TTS data for four South African languages (af, st, tn, xh) 

      Unknown author (Google; North-West University, 2017) ~ Resource Catalogue
      This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ...
    • Human Language Technology Audit 2017/18 

      Moors, Carmen, et al. (CSIR, 2018-08-31)
      This document reports on all work conducted in the 2017/18 Audit of human language technology (HLT) resources available in South Africa project. The ...
    • isiNdebele Custom Dictionary for Government Domain 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ Resource Catalogue
      Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
    • isiNdebele Genre Classification Corpus 

      Gerhard van Huyssteen, et al. (Trifonius, 2013-06-19) ~ Resource Catalogue
      Contains training and testing data for Genre Classification for isiNdebele.
    • isiXhosa Custom Dictionary for Government Domain 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ Resource Catalogue
      Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
    • isiXhosa Genre Classification Corpus 

      Gerhard van Huyssteen, et al. (Trifonius, 2013-06-19) ~ Resource Catalogue
      Contains training and testing data for Genre Classification for isiXhosa.
    • isiZulu Custom Dictionary for Government Domain 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ Resource Catalogue
      Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
    • isiZulu Genre Classification Corpus 

      Gerhard van Huyssteen, et al. (Trifonius, 2013-06-19) ~ Resource Catalogue
      Contains training and testing data for Genre Classification for isiZulu.
    • Lagos-NWU Yoruba Speech Corpus 

      Daniel van Niekerk, et al. (North-West University; Centre for Text Technology (CTexT); University of Lagos (Nigeria), 2015-02-06) ~ Resource Catalogue
      This speech corpus consisting of 16 female speakers and 17 male speakers was recorded in Lagos, Nigeria for the purpose of speech recognition research. ...
    • Lara2 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
      Tool for annotating texts with lemma, part of speech and morphological analysis information
    • Lwazi Afrikaans ASR corpus 

      Charl van Heerden, et al. (Meraka Institute, CSIR, 2013-04-02) ~ Resource Catalogue
      Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
    • Lwazi Afrikaans Pronunciation Dictionary 

      Marelie Davel (Meraka Institute, CSIR, 2013-04-01) ~ Resource Catalogue
      General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...
    • Lwazi Afrikaans TTS corpus 

      Daniel van Niekerk, et al. (Meraka Institute, CSIR, 2013-03-27) ~ Resource Catalogue
      Orthographic and phonemically aligned transcriptions
    • Lwazi English ASR corpus 

      Charl van Heerden, et al. (Meraka Institute, CSIR, 2013-04-02) ~ Resource Catalogue
      Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
    • Lwazi English Pronunciation Dictionary 

      Marelie Davel (Meraka Institute, CSIR, 2013-04-01) ~ Resource Catalogue
      General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology ...