Creative Commons Attribution 4.0 InternationalMcKellar, CindyPuttkammer, MartinGaustad, Tanjavan Heerden, JacquesGent, Sunny2022-12-152022-12-152020-09-30https://hdl.handle.net/20.500.12185/571Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline.TxtTshivenḓa segments: 78,952 Tshivenḓa words: 1,791,997AutshumatoTshivenḓaMonolingualAutshumato Monolingual Tshivenḓa Corpus3.55 Mb