Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeWikus PienaarWildrich FourieCindy McKellar2018-02-052018-03-052018-02-052018-03-052014-12-12McKellar, C.A. 2014. An English-Xitsonga SMT system for the government domain. (In: Proceedings of the 2014 PRASA, RobMech and AfLaT International Joint Symposium, Cape Town, South Africa).https://hdl.handle.net/20.500.12185/418Xitsonga monolingual corpus as deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a newline. NOTE: There is a newer version for an English-Xitsonga Monolingual Corpus. See https://hdl.handle.net/20.500.12185/570951.65 kBTextUTF8tsoAutshumato Xitsonga Monolingual CorporaData555-156-087-666-058 398 Xitsonga segments. Monolingual Lines: 58,398. Monolingual Words (excludes punctuation and numbers): 537,552