CTexTools 2

Eiselen, Roald; Puttkammer, Martin; Hocking, Justin; Kruger, Albertus

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/480'

CTexTools 2

Files

Install_CTexTools2.zip (252.61 MB)

Date

2018-05-24

Authors

Eiselen, Roald

Puttkammer, Martin

Hocking, Justin

Kruger, Albertus

Publisher

North-West University, Centre for Text Technology (CTexT)
South African Department of Arts and Culture

Description

CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and word lists, collocation searches and statistical analysis of corpus data. Furthermore, the tool provides automatic processing of corpus data on five levels: sentence separation, tokenisation, part of speech tagging, named entity recognition, and phrase chunking.

Available in the Help file - Output classes for named entity recognition:
B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside