Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeS.L. TshikotaM.E. TakalaniA. NyoniRoald Eiselen2018-02-052018-03-052018-02-052018-03-052016-04-29Eiselen, R. 2016. Government domain named entity recognition for South African languages. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.https://hdl.handle.net/20.500.12185/355Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.25.1 Mb (zipped)TextUTF8venNCHLT Tshivenda Named Entity Annotated CorpusData041-024-166-911-319,403 annotated token count (estimated 240,000 total tokens)