Search
Now showing items 11-20 of 25
Autshumato English-Setswana Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ - Resource Catalogue
Aligned English-Setswana parallel corpus. This set contains data that was translated by professional translators, data that was sourced as translated ...
Autshumato Text Anonymiser
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...
Autshumato English-isiZulu Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
Autshumato Afrikaans-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from Afrikaans to English (EN-GB), in the government domain for use in the Autshumato ITE application.
Autshumato Monolingual isiNdebele Corpus
(North-West University; Centre for Text Technology (CTexT), 2021-01-31)
Monolingual corpus for isiNdebele. The data is given as a single UTF-8 text file, with each segment on a newline.
Autshumato Sesotho sa Leboa-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from Sesotho sa Leboa to English (EN-GB), in the government domain for use in the Autshumato ITE application.
Autshumato isiZulu-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Translation memory from IsiZulu to English (EN-GB), in the government domain for use in the Autshumato ITE application.
Autshumato English-Afrikaans Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
Autshumato English-Xitsonga Manually Translated Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ - Resource Catalogue
Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
Autshumato TMX Integrator
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Utility to merge multiple translation memories over a network using Subversion