Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/480'
CTexTools 2
Loading...
Deposit Licenses
Date
2018-05-24
Authors
Eiselen, Roald
Puttkammer, Martin
Hocking, Justin
Kruger, Albertus
Journal Title
Journal ISSN
Volume Title
Publisher
North-West University, Centre for Text Technology (CTexT)
South African Department of Arts and Culture
South African Department of Arts and Culture
Abstract
Description
CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and word lists, collocation searches and statistical analysis of corpus data. Furthermore, the tool provides automatic processing of corpus data on five levels: sentence separation, tokenisation, part of speech tagging, named entity recognition, and phrase chunking.
Available in the Help file - Output classes for named entity recognition:
B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside
Available in the Help file - Output classes for named entity recognition:
B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside
Citation
Collections
Verification status
Level 0