Browsing by Title
Filter by:
Now showing items 70-89 of 528
-
Autshumato Machine Translation Evaluation Set
(North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~Resource Catalogue Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ... -
Autshumato Machine Translation Web Service (MTWS)
(Centre for Text Technology; North-West University, 2018-03-01) ~Resource Index The MTWS is a unified interface through which anyone can gain access to the MT systems developed in the Autshumato project. It can provide sentence, ... -
Autshumato Monolingual Afrikaans Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)Monolingual corpus for Afrikaans. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ... -
Autshumato Monolingual isiNdebele Corpus
(North-West University; Centre for Text Technology (CTexT), 2021-01-31)Monolingual corpus for isiNdebele. The data is given as a single UTF-8 text file, with each segment on a newline. -
Autshumato Monolingual isiZulu Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)Monolingual corpus for isiZulu. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ... -
Autshumato Monolingual Sepedi Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)Monolingual corpus for Sepedi. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ... -
Autshumato Monolingual Sesotho Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)Monolingual corpus for Sesotho. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ... -
Autshumato Monolingual Setswana Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)Monolingual corpus for Setswana. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ... -
Autshumato Monolingual Tshivenḓa Corpus
(North-West University; Centre for Text Technology (CTexT), 2020-09-30)Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline. -
Autshumato Monolingual Tshivenḓa Corpus
(North-West University; Centre for Text Technology (CTexT), 2023-12-12)Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline. -
Autshumato Monolingual Xitsonga Corpus
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)Monolingual corpus for Xitsonga. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and ... -
Autshumato Multilingual Word and Phrase Translations
(North-West University; Centre for Text Technology (CTexT), 2016-01-20) ~Resource Catalogue Word and phrase lists aligned from English to the other official South African languages. -
Autshumato PDF Text Extractor
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~Resource Catalogue Utility application for extracting text out of a PDF document. The pages can also be extracted as images. -
Autshumato Sesotho sa Leboa-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from Sesotho sa Leboa to English (EN-GB), in the government domain for use in the Autshumato ITE application. -
Autshumato Setswana Monolingual Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~Resource Catalogue Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line. NOTE: ... -
Autshumato Text Anonymiser
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~Resource Catalogue Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ... -
Autshumato TMS
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology. -
Autshumato TMX Integrator
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~Resource Catalogue Utility to merge multiple translation memories over a network using Subversion -
Autshumato Xitsonga Frequency Word List
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~Resource Catalogue A list of the most frequent Xitsonga words as deliverable of the Autshumato project. -
Autshumato Xitsonga Monolingual Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~Resource Catalogue Xitsonga monolingual corpus as deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a newline. NOTE: ...