Creative Commons Attribution-NoDerivatives 4.0 International: https://creativecommons.org/licenses/by-nd/4.0/Eiselen, RoaldPuttkammer, MartinHocking, JustinKruger, Albertus2018-05-242018-05-242018-05-24https://hdl.handle.net/20.500.12185/480CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and word lists, collocation searches and statistical analysis of corpus data. Furthermore, the tool provides automatic processing of corpus data on five levels: sentence separation, tokenisation, part of speech tagging, named entity recognition, and phrase chunking. \n\n Available in the Help file - Output classes for named entity recognition: \n B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT OutsideafrText analysis toolsSouth African languagesPart of speech taggingNamed entity recognitionCorpus processingPhrase chunkingTokenisationSentence separationCTexTools 2Tools258 Mb