Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeB.B. MalangwaneM.N. KekanaS.S. SedibeB.C. NdhlovuRoald Eiselen2018-02-052018-03-052018-02-052018-03-052016-04-29Eiselen, R. 2016. South African language resources: phrase chunkers. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.https://hdl.handle.net/20.500.12185/347Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens annotated during the NCHLT text resource development project and consists of a minimum of 15,000 tokens annotated as one of the six phrase types described in the protocol.1.87 Mb (zipped)TextUTF8sswNCHLT Siswati Phrase Chunk Annotated CorpusData115-580-004-414-816,008 Phrase chunk token count