Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeCindy McKellarRoald EiselenWikus Pienaar2018-02-052018-03-052018-02-052018-03-052016-10-28https://hdl.handle.net/20.500.12185/413Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line. NOTE: There is a newer version for English-Setswana Monolingual Corpus. See https://hdl.handle.net/20.500.12185/5841.52 Mb (zipped)TextUTF8tsnAutshumato Setswana Monolingual CorporaData818-990-855-312-3Monolingual Lines: 38 205. Monolingual Words (excludes punctuation and numbers): 879 248