Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/354'
NCHLT Tshivenda Lemmatiser
Loading...
Date
2014-05-30
Authors
 Martin Puttkammer 
 Martin Schlemmer 
 Ruan Bekker 
Journal Title
Journal ISSN
Volume Title
Publisher
North-West University
Centre for Text Technology (CTexT)
Centre for Text Technology (CTexT)
Abstract
Description
Lemmatiser developed during the NCHLT Text project.
Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one lowercase token per line. Output format: "Token tab Lemma".
Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one lowercase token per line. Output format: "Token tab Lemma".
Keywords
Citation
Eiselen, E.R. & Puttkammer, M.J. 2014. Developing text resources for ten South African languages. (In Proceedings of the 9th International Conference on Language Resources and Evaluation, Reykjavik, Iceland. p. 3698-3703)
Collections
Verification status
Level 0


