Filter by:

Now showing items 55-74 of 427

Filter options

    • Autshumato English-Sepedi Paralle Corpora 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Aligned parallel corpora for the language pair English-Sepedi. The data is given as two separate UTF-8 text files, with each aligned segment on a newline. ...
    • Autshumato English-Sesotho Parallel Corpora 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Aligned parallel corpora for the language pair English-Sesotho. The data is given as two separate UTF-8 text files, with each aligned segment on a ...
    • Autshumato English-Sesotho sa Leboa Parallel Corpora 

      D.P. Snyman, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
    • Autshumato English-Sesotho sa Leboa Translation Memory 

      Cindy McKellar, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Translation memory from English (EN-GB) to Sesotho sa Leboa, in the government domain for use in the Autshumato ITE application.
    • Autshumato English-Setswana Parallel Corpora 

      Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ Resource Catalogue
      Aligned English-Setswana parallel corpus. This set contains data that was translated by professional translators, data that was sourced as translated ...
    • Autshumato English-Setswana Parallel Corpora 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Aligned parallel corpora for the language pair English-Setswana. The data is given as two separate UTF-8 text files, with each aligned segment on a ...
    • Autshumato English-Tshivenḓa Parallel Corpora 

      McKellar, Cindy (North-West University; Centre for Text Technology (CTexT), 2020-09-30)
      Aligned parallel corpora for the following language pair: English-Tshivenḓa. Data was crawled from various multilingual government websites, sourced ...
    • Autshumato English-Xitsonga Manually Translated Parallel Corpora 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ Resource Catalogue
      Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
    • Autshumato English-Xitsonga Parallel Corpora 

      Wikus Pienaar, et al. (North-West University; Centre for Text Technology (CTexT), 2014-12-11) ~ Resource Catalogue
      Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
    • Autshumato English-Xitsonga Parallel Corpora 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Aligned parallel corpora for the language pair English-Xitsonga. The data is given as two separate UTF-8 text files, with each aligned segment on a ...
    • Autshumato isiZulu-English Translation Memory 

      Cindy McKellar, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Translation memory from IsiZulu to English (EN-GB), in the government domain for use in the Autshumato ITE application.
    • Autshumato ITE 

      Martin Schlemmer, et al. (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue
      Integrated Translation Environment. Combines multiple translation tools into one environment.
    • Autshumato Machine Translation Evaluation Set 

      McKellar, Cindy Arlene (North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ Resource Catalogue
      Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
    • Autshumato Machine Translation Web Service (MTWS) 

      Wildrich Fourie, et al. (Centre for Text Technology; North-West University, 2018-03-01) ~ Resource Index
      The MTWS is a unified interface through which anyone can gain access to the MT systems developed in the Autshumato project. It can provide sentence, ...
    • Autshumato Monolingual Afrikaans Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Afrikaans. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual isiNdebele Corpus 

      McKellar, Cindy (North-West University; Centre for Text Technology (CTexT), 2021-01-31)
      Monolingual corpus for isiNdebele. The data is given as a single UTF-8 text file, with each segment on a newline.
    • Autshumato Monolingual isiZulu Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for isiZulu. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Sepedi Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Sepedi. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Sesotho Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Sesotho. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...
    • Autshumato Monolingual Setswana Corpus 

      McKellar, Cindy (CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
      Monolingual corpus for Setswana. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ...