Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeR.S. PretoriusRoald Eiselen2018-02-052018-03-052018-02-052018-03-052016-04-29Eiselen, R. 2016. South African language resources: phrase chunkers. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.https://hdl.handle.net/20.500.12185/342Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens annotated during the NCHLT text resource development project and consists of a minimum of 15,000 tokens annotated as one of the six phrase types described in the protocol.1.65 Mb (zipped)TextUTF8tsnNCHLT Setswana Phrase Chunk Annotated CorpusData534-964-876-358-115,774 Phrase chunk token count