Creative Commons Attribution 4.0 InternationalMcKellar, CindyPuttkammer, MartinGaustad, TanjaGent, Sunnyvan Heerden, Jacques2024-03-272024-03-272023-12-12https://hdl.handle.net/20.500.12185/681Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline.Txt141,426 Tshivenḓa Segments & 2,870,916 Tshivenḓa WordsAutshumato VTshivenḓaMonolingual text corporaAutshumato Monolingual Tshivenḓa Corpus5.83Mb