Filter by:

Now showing items 1-20 of 240

Filter options

    • Afribooms Afrikaans Dependency Treebank 

      Daniel van Niekerk (North-West University; Centre for Text Technology (CTexT); Katholieke Universiteit Leuven (Belgium), 2015-02-10) ~ Resource Catalogue
      This is the annotated corpus developed for Afrikaans for the Afribooms project. The corpus includes annotations for lemma, part-of-speech (POS) and ...
    • African Speech Technology Afrikaans Text Corpus 

      CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ Resource Catalogue
      Monolingual text corpus developed during the African Speech Technology project.
    • African Speech Technology English Text Corpus 

      CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ Resource Catalogue
      Monolingual text corpus developed during the African Speech Technology project.
    • African Speech Technology isiXhosa Text Corpus 

      CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ Resource Catalogue
      Monolingual text corpus developed during the African Speech Technology project.
    • African Speech Technology isiZulu Text Corpus 

      CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ Resource Catalogue
      Monolingual text corpus developed during the African Speech Technology project.
    • African Wordnet: isiXhosa 1.0 

      African Wordnet Project (UNISA, 2017-06-20) ~ Resource Catalogue
      Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
    • African Wordnet: isiZulu 1.0 

      African Wordnet Project (UNISA, 2017-06-20) ~ Resource Catalogue
      Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
    • African Wordnet: Sesotho sa Leboa 1.0 

      African Wordnet Project (UNISA, 2017-06-20) ~ Resource Catalogue
      Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
    • African Wordnet: Setswana 1.0 

      African Wordnet Project (UNISA, 2017-06-20) ~ Resource Catalogue
      Developed using the expand model with Princeton WordNet 2.0 as basis.Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
    • African Wordnet: Tshivenda 1.0 

      African Wordnet Project (UNISA, 2017-06-20) ~ Resource Catalogue
      Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...
    • Afrikaans Chunker 

      Unknown author (University of South Africa, 2015-01-28) ~ Resource Index
      Chunker for Afrikaans based on memory-based machine learning.
    • Afrikaans Custom Dictionary for Government Domain 

      Gerhard van Huyssteen, et al. (North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ Resource Catalogue
      Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
    • Afrikaans Genre Classification Corpus 

      Gerhard van Huyssteen, et al. (Trifonius, 2013-06-19) ~ Resource Catalogue
      Contains training and testing data for Genre Classification for Afrikaans.
    • Afrikaans lexical blends dataset 

      Trollip, Benito, et al. (North-West University, 2023-12)
      This a dataset of Afrikaans blend constructions that have been collected and analysed using the Levenshtein distance metric. This dataset serves as the ...
    • Afrikaans linking element dataset 

      Trollip, EB (North-West University, 2019) ~ Resource Catalogue
      (Afrikaans follows English) This data set was compiled for a study in which the possible semantic content of Afrikaans linking elements was investigated. ...
    • Afrikaans Part of Speech Data 

      Unknown author (North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ Resource Index
      POS annotated data used to train POS tagger. The tagset was specifically designed for Afrikaans and consists of 139 pos-tags.
    • Afrikaans Pronunciation Dictionary 

      Unknown author (Meraka Institute, CSIR; Stellenbosch University, 2015-01-28) ~ Resource Index
      Pronunciation dictionary compiled from Lwazi ASR dictionary and the "Taalkommissie Korpus". Dictionary to be used for the development of a large vocabulary ...
    • Afrikaans speaking children's first lexical items 

      Brink, Nina (North-West University, 2018-05-17) ~ Resource Catalogue
      Data collected for a master's study in Afrikaans linguistics. The data consist of the first lexical items of 21 Afrikaans speaking children. The lexical ...
    • Afrikaans text unit identification data 

      Puttkammer, Martin (Centre for Text Technology, North-West University, 2006) ~ Resource Catalogue
      This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ...
    • Afrikaans TnT-Tagger 

      Unknown author (North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ Resource Index
      The Afrikaans TnT-tagger is a part of speech tagger that can be used to add part of speech tags to Afrikaans texts.The tagger is an Afrikaans version ...