Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeD.J. PrinslooRoald Eiselen2018-02-052018-03-052018-02-052018-03-052016-04-29Eiselen, R. 2016. Government domain named entity recognition for South African languages. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.https://hdl.handle.net/20.500.12185/328Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.21.0 Mb (zipped)TextUTF8nsoNCHLT Sepedi Named Entity Annotated CorpusData157-459-902-133-524,058 annotated tokens (estimated 200,000 total tokens)