Filter by:

Now showing items 36-55 of 232

Filter options

    • Autshumato English-Setswana Parallel Corpora 

      Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ Resource Catalogue
      Aligned English-Setswana parallel corpus. This set contains data that was translated by professional translators, data that was sourced as translated ...
    • Autshumato English-Xitsonga Manually Translated Parallel Corpora 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue
      Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
    • Autshumato English-Xitsonga Parallel Corpora 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-11) ~ Resource Catalogue
      Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
    • Autshumato isiZulu-English Translation Memory 

      Cindy McKellar, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Translation memory from IsiZulu to English (EN-GB), in the government domain for use in the Autshumato ITE application.
    • Autshumato ITE 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Integrated Translation Environment. Combines multiple translation tools into one environment.
    • Autshumato Machine Translation Evaluation Set 

      McKellar, Cindy Arlene (North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ Resource Catalogue
      Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
    • Autshumato Multilingual Word and Phrase Translations 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2016-01-20) ~ Resource Catalogue
      Word and phrase lists aligned from English to the other official South African languages.
    • Autshumato PDF Text Extractor 

      Wildrich Fourie (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue
      Utility application for extracting text out of a PDF document. The pages can also be extracted as images.
    • Autshumato Sesotho sa Leboa-English Translation Memory 

      Cindy McKellar, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Translation memory from Sesotho sa Leboa to English (EN-GB), in the government domain for use in the Autshumato ITE application.
    • Autshumato Setswana Monolingual Corpora 

      Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ Resource Catalogue
      Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line.
    • Autshumato Text Anonymiser 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue
      Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...
    • Autshumato TMS 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.
    • Autshumato TMX Integrator 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue
      Utility to merge multiple translation memories over a network using Subversion
    • Autshumato Xitsonga Frequency Word List 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue
      A list of the most frequent Xitsonga words as deliverable of the Autshumato project.
    • Autshumato Xitsonga Monolingual Corpora 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue
      Xitsonga monolingual corpus as deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a newline.
    • Bukantswe Sesotho-English Bilingual Dictionary 

      J. A. K. Olivier (North-West University, 2016-07-07) ~ Resource Catalogue
      Bilingual English-Sesotho dictionary. This dataset represents a basic Sesotho dictionary compiled in the creation of a Sesotho language resource. The ...
    • CTexT Alignment Interface 

      Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ Resource Catalogue
      Utility application for the manual alignment of source texts.
    • CTexT Alignment Interface Pro 

      Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ Resource Catalogue
      Utility application for the manual alignment of source texts. Pro version allows for the editing of the segments.
    • CTexTools 

      Martin Puttkammer, et al. (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue
      Corpus query and manipulation tool for performing tokenisation and sentencisation; extracting frequency list and word list; searching; and extracting ...
    • CTexTools 2 

      Eiselen, Roald, et al. (North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ Resource Catalogue
      CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...