Repository logoRepository logo
 

Autshumato English-Setswana Parallel Corpora

Loading...
Thumbnail Image

Date

2016-10-28

Authors

Cindy McKellar

Journal Title

Journal ISSN

Volume Title

Publisher

North-West University
Centre for Text Technology (CTexT)

Abstract

Description

Aligned English-Setswana parallel corpus. This set contains data that was translated by professional translators, data that was sourced as translated file pairs from translators and data obtained from Government websites and documents. The data is given as six separate UTF-8 text files; with each aligned sentence pair on a new line.

Keywords

Citation

Verification status

Level 0