Filter by:

Now showing items 53-72 of 333

Filter options

    • CGE's Afrikaans Gender Terminology List 

      Commission for Gender Equality (CGE), et al. (Commission for Gender Equality (CGE), 2021-04)
      CGE's Afrikaans Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 436 words or phrases ...
    • Core technologies for conjunctively written South African languages 

      Du Toit, Jaco, et al. (North-West University, Centre for Language Technology (CTexT), 2021-03-31)
      During this SADiLaR funded project, enriched corpora for the four official South African languages with a conjunctive orthography, i.e. isiNdebele ...
    • Corpus of multilingual code-switched soap opera speech 

      van der Westhuizen, Ewald, et al. (Stellenbosch University, 2020-02-28)
      The corpus comprises 26.9 hours of annotated multilingual speech that contains examples of code-switching in isiZulu, isiXhosa, Setswana, Sesotho and ...
    • COVID-19 Multilingual Terminology 

      City of Tshwane, et al. (City of Tshwane; South African Centre for Digital Language Resources (SADiLaR); Department of Science and Innovation; Pan South African Language Board (PanSALB), 2021-07)
      COVID-19 multilingual terminology list document in all the South African languages. The development of this terminology list was initiated by City of ...
    • CTexT Afrikaans fastText CBoW String Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans fastText CBoW String Embeddings is a 300 dimensional Afrikaans embedding model based on the Contunious Bag of Words fastText ...
    • CTexT Afrikaans FLAIR Named Entity Recognition model 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans FLAIR Named Entity Recognition model is a neural NER model based on the FLAIR framework (Akbik et al. 2019), and includes Afrikaans ...
    • CTexT Afrikaans FLAIR Part of Speech tagger model 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans FLAIR Part of Speech tagger model is a neural part of speech tagger model based on the FLAIR framework (Akbik et al. 2019), and ...
    • CTexT Afrikaans FLAIR String Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans FLAIR String Embeddings are two Afrikaans embedding models based on the FLAIR architecture (Akbik et al. 2018, 2019) that provides ...
    • CTexT Afrikaans GloVe Word Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans GloVe Word Embeddings is a 300 dimensional Afrikaans embedding model based on the Global Vectors architecture (Pennington, 2014) ...
    • CTexT Alignment Interface 

      Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ Resource Catalogue
      Utility application for the manual alignment of source texts.
    • CTexT Alignment Interface Pro 

      Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ Resource Catalogue
      Utility application for the manual alignment of source texts. Pro version allows for the editing of the segments.
    • CTexT fastText Skipgram String Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans fastText Skipgram String Embeddings is a 300 dimensional Afrikaans embedding model based on the Skipgram fastText architecture that ...
    • CTexTools 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
      Corpus query and manipulation tool for performing tokenisation and sentencisation; extracting frequency list and word list; searching; and extracting ...
    • CTexTools 2 

      Eiselen, Roald, et al. (North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ Resource Catalogue
      CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...
    • Denominal adjectives in Afrikaans dataset 

      Trollip, Benito (South African Centre for Digital Language Resources, 2020-05-15) ~ Resource Catalogue
      This dataset contain a collection of Afrikaans denominal adjectives that were extracted from the Virtual Institute for Afrikaans' corpus portal. The ...
    • DictionaryMaker 

      Marelie Davel, et al. (Meraka Institute, CSIR, 2013-07-15) ~ Resource Catalogue
      The purpose of the DictionaryMaker system is to facilitate the creation of an electronic pronunciation dictionary in a target language, as originally ...
    • Generic Bilingual Academic Wordlist with Definitions 

      ICELDA, et al. (ICELDA; SADiLaR, 2021)
      The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ...
    • Generic Multilingual Academic Wordlists with Definitions 

      Van Dyk, Tobie (SADiLaR; ICELDA, 2022)
      This multilingual generic academic wordlist has been developed to serve as a resource to students to assist with building a vocabulary and decoding ...
    • High quality TTS data for four South African languages (af, st, tn, xh) 

      Unknown author (Google; North-West University, 2017) ~ Resource Catalogue
      This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ...
    • Human Language Technology Audit 2017/18 

      Moors, Carmen, et al. (CSIR, 2018-08-31)
      This document reports on all work conducted in the 2017/18 Audit of human language technology (HLT) resources available in South Africa project. The ...