Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeM. SetakaRoald Eiselen2018-02-052018-03-052018-02-052018-03-052016-04-29Eiselen, R. 2016. Government domain named entity recognition for South African languages. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.https://hdl.handle.net/20.500.12185/334Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.29.0 Mb (zipped)TextUTF8sotNCHLT Sesotho Named Entity Annotated CorpusData523-686-016-526-723,737 annotated tokens (estimated 270,000 total tokens)