Autshumato English-Setswana Parallel Corpora

McKellar, Cindy

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/578'

Autshumato English-Setswana Parallel Corpora

Files

Autshumato.BilingualCorpus(English-Setswana).v2.0.zip (18.06 MB)

Deposit Licenses

license.txt (3.23 KB)

Date

2022-09-30

Authors

McKellar, Cindy

Publisher

CTexT® (Centre for Text Technology, North-West University)

Description

Aligned parallel corpora for the language pair English-Setswana. The data is given as two separate UTF-8 text files, with each aligned segment on a newline. The data was specifically selected and formatted for use in the training of machine translation systems. Further clean-up and processing might be required depending on the task the data is reused for.

Keywords

Autshumato, English, Setswana

License

Creative Commons Attribution 4.0 International

URI

https://hdl.handle.net/20.500.12185/578

Collections

Resource Index

Verification status

Level 0

Full item page

Autshumato English-Setswana Parallel Corpora

Files

Deposit Licenses

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

License

URI

Collections

Verification status