Repository logoRepository logo
 

NCHLT Tshivenda Text Corpora

Loading...
Thumbnail Image

Date

2014-05-30

Authors

Martin Puttkammer
Martin Schlemmer
Wikus Pienaar
Ruan Bekker

Journal Title

Journal ISSN

Volume Title

Publisher

North-West University
Centre for Text Technology (CTexT)

Abstract

Description

Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed during the NCHLT Text project.

Keywords

Citation

Eiselen, E.R. & Puttkammer, M.J. 2014. Developing text resources for ten South African languages. (In Proceedings of the 9th International Conference on Language Resources and Evaluation, Reykjavik, Iceland. p. 3698-3703)

Verification status

Level 0