NCHLT Siswati Phrase Chunk Annotated Corpus

B.B. Malangwane; M.N. Kekana; S.S. Sedibe; B.C. Ndhlovu; Roald Eiselen

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/347'

NCHLT Siswati Phrase Chunk Annotated Corpus

Files

nchlt_siswati_phrase_chunk_annotated_corpus.zip (1.87 MB)

Date

2016-04-29

Authors

B.B. Malangwane

M.N. Kekana

S.S. Sedibe

B.C. Ndhlovu

Roald Eiselen

Publisher

North-West University
Centre for Text Technology (CTexT)

Description

Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens annotated during the NCHLT text resource development project and consists of a minimum of 15,000 tokens annotated as one of the six phrase types described in the protocol.

Citation

Eiselen, R. 2016. South African language resources: phrase chunkers. Proceedings of the 10th Language Resource and Evaluation Conference, Portorož, Slovenia.

License

Creative Commons Attribution 2.5 South Africa License

URI

https://hdl.handle.net/20.500.12185/347

Collections

Resource Catalogue
Resource Index

Verification status

Level 0

Full item page

NCHLT Siswati Phrase Chunk Annotated Corpus

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

License

URI

Collections

Verification status