Filter by:

Now showing items 70-89 of 526

Filter options

    • Autshumato Machine Translation Web Service (MTWS) 

      Wildrich Fourie, et al. (Centre for Text Technology; North-West University, 2018-03-01) ~ Resource Index
      The MTWS is a unified interface through which anyone can gain access to the MT systems developed in the Autshumato project. It can provide sentence, ...
    • Autshumato Monolingual Afrikaans Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Afrikaans. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual isiNdebele Corpus 

      McKellar, Cindy (North-West University; Centre for Text Technology (CTexT), 2021-01-31)
      Monolingual corpus for isiNdebele. The data is given as a single UTF-8 text file, with each segment on a newline.
    • Autshumato Monolingual isiZulu Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for isiZulu. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Sepedi Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Sepedi. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Sesotho Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Sesotho. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Setswana Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Setswana. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Tshivenḓa Corpus 

      McKellar, Cindy (North-West University; Centre for Text Technology (CTexT), 2020-09-30)
      Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline.
    • Autshumato Monolingual Tshivenḓa Corpus 

      McKellar, Cindy (North-West University; Centre for Text Technology (CTexT), 2023-12-12)
      Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline.
    • Autshumato Monolingual Xitsonga Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Xitsonga. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Multilingual Word and Phrase Translations 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2016-01-20) ~ Resource Catalogue
      Word and phrase lists aligned from English to the other official South African languages.
    • Autshumato PDF Text Extractor 

      Wildrich Fourie (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue
      Utility application for extracting text out of a PDF document. The pages can also be extracted as images.
    • Autshumato Sesotho sa Leboa-English Translation Memory 

      Cindy McKellar, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Translation memory from Sesotho sa Leboa to English (EN-GB), in the government domain for use in the Autshumato ITE application.
    • Autshumato Setswana Monolingual Corpora 

      Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ Resource Catalogue
      Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line.
    • Autshumato Text Anonymiser 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue
      Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...
    • Autshumato TMS 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.
    • Autshumato TMX Integrator 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue
      Utility to merge multiple translation memories over a network using Subversion
    • Autshumato Xitsonga Frequency Word List 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue
      A list of the most frequent Xitsonga words as deliverable of the Autshumato project.
    • Autshumato Xitsonga Monolingual Corpora 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue
      Xitsonga monolingual corpus as deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a newline.
    • Bambara Monolingual Children First Language Acquisition (Babbling & First Words) 

      CISSE, Ibrahima Abdoul Hayou (Ibrahima Abdoul Hayou CISSE, 2010)
      Dataset contains videos of children interacting with caregivers. Languages included: Bambara/Bamanakan/Dioula/Mande