Repository logoRepository logo
 

Autshumato English-isiNdebele Parallel Corpora

dc.contact.emailsunny.gent@nwu.ac.zaen_ZA
dc.contact.nameSunny Genten_ZA
dc.contributor.authorMcKellar, Cindy
dc.contributor.otherPuttkammer, Martin
dc.contributor.otherGaustad, Tanja
dc.contributor.othervan Heerden, Jacques
dc.contributor.otherGent, Sunny
dc.date.accessioned2022-12-15T06:35:04Z
dc.date.available2022-12-15T06:35:04Z
dc.date.issued2021-01-31
dc.descriptionAligned parallel corpora for the following language pair: English-isiNdebele. Data was crawled from various multilingual government websites, sourced from translated material and created by translating English sentences into isiNdebele.en_ZA
dc.formatTxten_ZA
dc.format.extentSegments: 128,382 English Words: 2,067,749 isiNdebele Words: 1,490,423en_ZA
dc.format.size10.2 Mben_ZA
dc.identifier.urihttps://hdl.handle.net/20.500.12185/572
dc.languagesEnglishen_ZA
dc.languagesisiNdebeleen_ZA
dc.media.categoryMultilingual text corpora: Aligneden_ZA
dc.projectAutshumatoen_ZA
dc.publisherNorth-West University; Centre for Text Technology (CTexT)en_ZA
dc.rights.licenseCreative Commons Attribution 4.0 Internationalen_ZA
dc.subjectAutshumatoen_ZA
dc.subjectParallel Corporaen_ZA
dc.subjectisiNdebeleen_ZA
dc.titleAutshumato English-isiNdebele Parallel Corporaen_ZA
dc.versionVersion: 1.0 (Final)en_ZA
local.urlhttp://humanities.nwu.ac.za/ctexten_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Corpus.DACB5.FinalDrop-Bilingual(EN-NR).1.0.0.CAM.2021-01-31.nr.zip
Size:
10.29 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.
Description:
Final Drop-Bilingual

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.23 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections