NCHLT Tshivenda Phrase Chunk Annotated Corpus

S.L. Tshikota; M.E. Takalani; A. Nyoni; Roald Eiselen

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/356'

NCHLT Tshivenda Phrase Chunk Annotated Corpus

Files

nchlt_tshivenda_phrase_chunk_annotated_corpus.zip (1.68 MB)

Date

2016-04-29

Authors

S.L. Tshikota

M.E. Takalani

A. Nyoni

Roald Eiselen

Publisher

North-West University
Centre for Text Technology (CTexT)

Description

Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens annotated during the NCHLT text resource development project and consists of a minimum of 15,000 tokens annotated as one of the six phrase types described in the protocol.

Citation

Eiselen, R. 2016. South African language resources: phrase chunkers. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.

License

Creative Commons Attribution 2.5 South Africa License

URI

https://hdl.handle.net/20.500.12185/356

Collections

Resource Catalogue
Resource Index

Verification status

Level 0

Full item page

NCHLT Tshivenda Phrase Chunk Annotated Corpus

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

License

URI

Collections

Verification status