Filter by:

Now showing items 75-94 of 411

Filter options

    • Calomo 

      Menno van Zaanen (North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ Resource Index
      Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, ...
    • CGE's Afrikaans Gender Terminology List 

      Commission for Gender Equality (CGE), et al. (Commission for Gender Equality (CGE), 2021-04)
      CGE's Afrikaans Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 436 words or phrases ...
    • CGE's Sesotho Gender Terminology List 

      Commission for Gender Equality (CGE), et al. (Commission for Gender Equality (CGE), 2018)
      CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ...
    • CKarma 

      Unknown author (North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ Resource Index
      CKarma is a compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and produces ...
    • Code-switching among bilingual Afrikaans-Dutch children 

      Rabe, Monique (North-West University, 2021)
      To describe the code-switching in the speech of pre-school, bilingual Afrikaans-Dutch children, novel language data of eight preschool children between ...
    • Combination Tagger 

      Unknown author (North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ Resource Index
      The combination tagger framework uses MBT, SVM, MXPOST and TnT. Each tagger receives a weight by which it can vote for a tag.
    • CompanyCall v1.0 

      Unknown author (, 2013-07-01) ~ Resource Index
      Routes calls based on spoken company name, south african names, ses ASR name re ??, which domain) directory assistance, 08606companycall.co.za.
    • Core technologies for conjunctively written South African languages 

      Du Toit, Jaco, et al. (North-West University, Centre for Language Technology (CTexT), 2021-03-31)
      During this SADiLaR funded project, enriched corpora for the four official South African languages with a conjunctive orthography, i.e. isiNdebele ...
    • Corpus of multilingual code-switched soap opera speech 

      van der Westhuizen, Ewald, et al. (Stellenbosch University, 2020-02-28)
      The corpus comprises 26.9 hours of annotated multilingual speech that contains examples of code-switching in isiZulu, isiXhosa, Setswana, Sesotho and ...
    • CorpusCatcher 

      Unknown author (Translate.org.za, 2015-01-28) ~ Resource Index
      Corpus Catcher is a tool that is designed to crawl the web to retrieve data for inclusion in a corpus. It makes use of seed documents/wordlists to ...
    • COVID-19 Multilingual Terminology 

      City of Tshwane, et al. (City of Tshwane; South African Centre for Digital Language Resources (SADiLaR); Department of Science and Innovation; Pan South African Language Board (PanSALB), 2021-07)
      COVID-19 multilingual terminology list document in all the South African languages. The development of this terminology list was initiated by City of ...
    • CTexT Afrikaans fastText CBoW String Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans fastText CBoW String Embeddings is a 300 dimensional Afrikaans embedding model based on the Contunious Bag of Words fastText ...
    • CTexT Afrikaans FLAIR Named Entity Recognition model 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans FLAIR Named Entity Recognition model is a neural NER model based on the FLAIR framework (Akbik et al. 2019), and includes Afrikaans ...
    • CTexT Afrikaans FLAIR Part of Speech tagger model 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans FLAIR Part of Speech tagger model is a neural part of speech tagger model based on the FLAIR framework (Akbik et al. 2019), and ...
    • CTexT Afrikaans FLAIR String Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans FLAIR String Embeddings are two Afrikaans embedding models based on the FLAIR architecture (Akbik et al. 2018, 2019) that provides ...
    • CTexT Afrikaans GloVe Word Embeddings 

      Eiselen, Roald (Centre for Text Technology (CTexT), 2022-01-10)
      The CTexT Afrikaans GloVe Word Embeddings is a 300 dimensional Afrikaans embedding model based on the Global Vectors architecture (Pennington, 2014) ...
    • CTexT Afrikaanse Grammatikatoetser 1.0 

      Unknown author (Centre for Text Technology (CTexT), 2013-07-01) ~ Resource Index
      The CTexT Afrikaanse Grammatikatoetser is the first Afrikaans grammar checker for Microsoft Office, and can identify a number of grammatical and style errors.
    • CTexT Afrikaanse Tesourus 1.0 

      Unknown author (Centre for Text Technology (CTexT), 2013-07-01) ~ Resource Index
      The CTexT Afrikaanse Thesaurus is based on the Groot Afrikaans Tesourus of De Stadler.
    • CTexT Alignment Interface 

      Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ Resource Catalogue
      Utility application for the manual alignment of source texts.
    • CTexT Alignment Interface Pro 

      Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ Resource Catalogue
      Utility application for the manual alignment of source texts. Pro version allows for the editing of the segments.