Repository logoRepository logo
 

NCHLT Sepedi Phrase Chunk Annotated Corpus

Loading...
Thumbnail Image

Date

2016-04-29

Authors

D.J. Prinsloo
Roald Eiselen

Journal Title

Journal ISSN

Volume Title

Publisher

North-West University
Centre for Text Technology (CTexT)

Abstract

Description

Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens annotated during the NCHLT text resource development project and consists of a minimum of 15,000 tokens annotated as one of the six phrase types described in the protocol.

Keywords

Citation

Eiselen, R. 2016. South African language resources: phrase chunkers. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.

Verification status

Level 0