Creative Commons Attribution-NonCommercial-ShareAlike 2.5 South Africa: http://creativecommons.org/licenses/by-nc-sa/2.5/za/D.P. SnymanCindy McKellarHandré Groenewald2018-02-052018-03-052018-02-052018-03-052013-06-19https://hdl.handle.net/20.500.12185/402Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from the SA government domain.2.6 MbUTF8AlignedSentence segmentedengAutshumato English-Sesotho sa Leboa Parallel CorporaData954-612-592-883-6TEXT: 44 981 sentences (tokens)