Show simple item record

Autshumato Monolingual Tshivenḓa Corpus
Monolingual corpus for Tshivenḓa. The data is given as a single UTF-8 text file, with each segment on a newline.
Sunny Gent
sunny.gent@nwu.ac.za
North-West University; Centre for Text Technology (CTexT)
Creative Commons Attribution 4.0 International
Tshivenda
McKellar, Cindy
Puttkammer, Martin; Gaustad, Tanja; Gent, Sunny; van Heerden, Jacques
Autshumato V; Tshivenḓa; Monolingual text corpora
https://hdl.handle.net/20.500.12185/681
Text
Multilingual text corpora
141,426 Tshivenḓa Segments & 2,870,916 Tshivenḓa Words
3.0 (Final)
5.83Mb
Autshumato
2024-03-27T08:27:10Z
2024-03-27T08:27:10Z
2023-12-12


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Resource Catalogue [349]
    A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture.

Show simple item record