Browsing Resource Index by Project "Autshumato"
Filter by:
Now showing items 1-20 of 25
-
Autshumato Afrikaans-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from Afrikaans to English (EN-GB), in the government domain for use in the Autshumato ITE application. -
Autshumato English-Afrikaans Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ... -
Autshumato English-Afrikaans Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from English (EN-GB) to Afrikaans, in the government domain for use in the Autshumato ITE application. -
Autshumato English-isiNdebele Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2021-01-31)Aligned parallel corpora for the following language pair: English-isiNdebele. Data was crawled from various multilingual government websites, sourced ... -
Autshumato English-isiZulu Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ... -
Autshumato English-isiZulu Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from English (EN-GB) to isiZulu, in the government domain for use in the Autshumato ITE application. -
Autshumato English-Sesotho sa Leboa Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ... -
Autshumato English-Sesotho sa Leboa Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from English (EN-GB) to Sesotho sa Leboa, in the government domain for use in the Autshumato ITE application. -
Autshumato English-Setswana Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~Resource Catalogue Aligned English-Setswana parallel corpus. This set contains data that was translated by professional translators, data that was sourced as translated ... -
Autshumato English-Tshivenḓa Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2020-09-30)Aligned parallel corpora for the following language pair: English-Tshivenḓa. Data was crawled from various multilingual government websites, sourced ... -
Autshumato English-Xitsonga Manually Translated Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~Resource Catalogue Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline. -
Autshumato English-Xitsonga Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-11) ~Resource Catalogue Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline. -
Autshumato isiZulu-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from IsiZulu to English (EN-GB), in the government domain for use in the Autshumato ITE application. -
Autshumato ITE
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Integrated Translation Environment. Combines multiple translation tools into one environment. -
Autshumato Monolingual isiNdebele Corpus
(North-West University; Centre for Text Technology (CTexT), 2021-01-31)Monolingual corpus for isiNdebele. The data is given as a single UTF-8 text file, with each segment on a newline. -
Autshumato Monolingual Tshivenḓa Corpus
(North-West University; Centre for Text Technology (CTexT), 2020-09-30)Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline. -
Autshumato Multilingual Word and Phrase Translations
(North-West University; Centre for Text Technology (CTexT), 2016-01-20) ~Resource Catalogue Word and phrase lists aligned from English to the other official South African languages. -
Autshumato PDF Text Extractor
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~Resource Catalogue Utility application for extracting text out of a PDF document. The pages can also be extracted as images. -
Autshumato Sesotho sa Leboa-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from Sesotho sa Leboa to English (EN-GB), in the government domain for use in the Autshumato ITE application. -
Autshumato Setswana Monolingual Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~Resource Catalogue Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line. NOTE: ...