Filter by:

Now showing items 214-233 of 526

Filter options

    • Lwazi Xitsonga TTS corpus 

      Daniel van Niekerk, et al. (Meraka Institute, CSIR, 2013-03-27) ~ Resource Catalogue
      Orthographic and phonemically aligned transcriptions
    • Mburisano Covid-19 multilingual corpus 

      Marais, Laurette (CSIR Voice Computing, 2020-12-04)
      This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, ...
    • Media24 Afrikaans Test Corpus 

      Unknown author (Department of African Languages - University of Pretoria, 2015-01-27) ~ Resource Index
      Collection of texts from Media24 newspaper texts for building lexicographic modules such as the Afrikaans Trajectories
    • MELFA (Mobile E-Learning for Africa) 

      Unknown author (CatchWord; Stellenbosch University; Sensus; Copenhagen University, 2013-07-01) ~ Resource Index
      Embedded TTS on mobile phones providing speech output of full texts for various types of applications.
    • MobiLex and SA Trilingual Wine Industry Dictionary 

      Unknown author (Stellenbosch University; Winetech; SAWIS, 2018-02-28) ~ Resource Index
      Trilingual LSP dictionary on cellphone and website Trilingual LSP dictionary on website
    • Monolingual isiXhosa corpus 

      McKellar, Cindy (North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~ Resource Catalogue
      Monolingual corpus for isiXhosa. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data ...
    • Monolingual Siswati Corpus 

      McKellar, Cindy (North-West University - Centre for Text Technology (CTexT), 2022-03-31)
      Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ...
    • Morphologically annotated corpus for isiNdebele 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in isiNdebele converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data ...
    • Morphologically annotated corpus for isiXhosa 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in isiXhosa converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for isiZulu 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in isiZulu converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for Sepedi 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in Sepedi converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for Sesotho 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in Sesotho converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for Setswana 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in Setswana converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for Siswati 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in Siswati converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for Tshivenḓa 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in Tshivenḓa converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphologically annotated corpus for Xitsonga 

      Gaustad, Tanja (Centre for Text Technology (CTexT), 2024-01-31)
      NCHLT corpus of morphologically annotated tokens in Xitsonga converted to the tags used during phases 1 and 2 of the SADiLaR-II project. The data is ...
    • Morphosyntactic Drag-and-Drop Tagger for isiXhosa 

      Unknown author (University of South Africa; Gothenburg University, 2015-01-28) ~ Resource Index
      The tagger and the tag set were developed under the auspices of the Spoken Language Corpus Project (Unisa & Gothenburg University) as part of the ...
    • Morphosyntactic Tag Set for isiXhosa 

      Unknown author (University of South Africa; Gothenburg University, 2015-01-28) ~ Resource Index
      The tagger and the tag set were developed under the auspices of the Spoken Language Corpus Project (Unisa & Gothenburg University) as part of the ...
    • Multilingual Arts & Culture Intermediate Phase Terminology List 

      Unknown author (Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ Resource Index
      550 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
    • Multilingual Election Terminology List 

      Unknown author (Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ Resource Index
      559 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...