Repository logoRepository logo
 

Autshumato English-isiZulu Parallel Corpora

dc.contact.emailsunny.gent@nwu.ac.za
dc.contact.nameSunny Gent
dc.contributor.authorD.P. Snyman
dc.contributor.authorCindy McKellar
dc.contributor.authorHandré Groenewald
dc.date.accessioned2018-02-05T20:25:01Z
dc.date.accessioned2018-03-05T17:49:18Z
dc.date.available2018-02-05T20:25:01Z
dc.date.available2018-03-05T17:49:18Z
dc.date.issued2013-06-19
dc.descriptionParallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from the SA government domain. NOTE: There is a newer version for English-isiZulu Parallel Corpora. See https://hdl.handle.net/20.500.12185/575
dc.format.extent2 Mb
dc.format.mediumUTF8
dc.format.mediumAligned
dc.format.mediumSentence segmented
dc.format.sizeTEXT: 35 490 sentences (tokens)
dc.identifier.islrn101-618-922-810-4
dc.identifier.urihttps://hdl.handle.net/20.500.12185/399
dc.language.isoeng
dc.language.isozul
dc.languagesEnglish
dc.languagesisiZulu
dc.media.categoryMultilingual text corpora: Aligned
dc.media.typeText
dc.projectAutshumato
dc.publisherNorth-West University
dc.publisherCentre for Text Technology (CTexT)
dc.rights.licenseCreative Commons Attribution-NonCommercial-ShareAlike 2.5 South Africa: http://creativecommons.org/licenses/by-nc-sa/2.5/za/
dc.sourceGovernment Documents
dc.stratum100% government domain
dc.titleAutshumato English-isiZulu Parallel Corpora
dc.typeData
dc.version1
local.collection.primaryResource Catalogue
local.collection.secondaryResource Index

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
en-zu.release.zip
Size:
1.97 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.