NCHLT Sesotho Text Corpora

Martin Puttkammer; Martin Schlemmer; Wikus Pienaar; Ruan Bekker

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/336'

NCHLT Sesotho Text Corpora

Files

corpora.nchlt.st.zip (9.92 MB)

Date

2014-05-30

Authors

Martin Puttkammer

Martin Schlemmer

Wikus Pienaar

Ruan Bekker

Publisher

North-West University
Centre for Text Technology (CTexT)

Description

Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed during the NCHLT Text project.

Citation

Eiselen, E.R. & Puttkammer, M.J. 2014. Developing text resources for ten South African languages. (In Proceedings of the 9th International Conference on Language Resources and Evaluation, Reykjavik, Iceland. p. 3698-3703)

License

Creative Commons Attribution 2.5 South Africa License

URI

https://hdl.handle.net/20.500.12185/336

Collections

Resource Catalogue
Resource Index

Verification status

Level 0

Full item page

NCHLT Sesotho Text Corpora

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

License

URI

Collections

Verification status