Repository logoRepository logo
 

CTexTools 2

Loading...
Thumbnail Image

Deposit Licenses

Date

2018-05-24

Authors

Eiselen, Roald
Puttkammer, Martin
Hocking, Justin
Kruger, Albertus

Journal Title

Journal ISSN

Volume Title

Publisher

North-West University, Centre for Text Technology (CTexT)
South African Department of Arts and Culture

Abstract

Description

CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and word lists, collocation searches and statistical analysis of corpus data. Furthermore, the tool provides automatic processing of corpus data on five levels: sentence separation, tokenisation, part of speech tagging, named entity recognition, and phrase chunking.

Available in the Help file - Output classes for named entity recognition:
B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside

Citation

Verification status

Level 0