Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/583'
Autshumato Monolingual Sesotho Corpus
Loading...
Deposit Licenses
Date
2022-09-30
Authors
McKellar, Cindy
Journal Title
Journal ISSN
Volume Title
Publisher
CTexT® (Centre for Text Technology, North-West University)
Abstract
Description
Monolingual corpus for Sesotho. The data is given as a single UTF-8 text file, with each segment on a newline. The data was specifically selected and formatted for use in the training of machine translation systems. Further clean-up and processing might be required depending on the task the data is reused for.
Keywords
Citation
License
Creative Commons Attribution 4.0 International
Collections
Verification status
Level 0