Autshumato English-isiZulu Parallel Corpora
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Download
MD5: 21d1c4279c62cc10e2154f49e5eee6ee
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Collections
- Resource Index [412]
Author(s)
McKellar, Cindy
Metadata
Show full item recordDescription
Aligned parallel corpora for the language pair English-isiZulu. The data is given as two separate UTF-8 text files, with each aligned segment on a newline. The data was specifically selected and formatted for use in the training of machine translation systems. Further clean-up and processing might be required depending on the task the data is reused for.
Contact person
Sunny GentContact person's e-mail address
sunny.gent@nwu.ac.zaPublisher(s)
CTexT® (Centre for Text Technology, North-West University)