Autshumato English-Tshivenḓa Parallel Corpora

McKellar, Cindy

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/682'

Autshumato English-Tshivenḓa Parallel Corpora

Files

lcontent.SADILAR.BilingualCorpus(EN-VE).3.0.0.CAM.2023-12-12.en.zip (9.74 MB)

Date

2023-12-12

Authors

McKellar, Cindy

Publisher

North-West University; Centre for Text Technology (CTexT)

Description

Aligned parallel corpora for the following language pair: English-Tshivenḓa. Data was crawled from various multilingual government websites, sourced from translated material and created by translating English sentences into Tshivenḓa. The data is given as two separate UTF-8 text files, with each aligned segment on a newline.