Repository logoRepository logo
 

CTexTools 2

dc.contact.emailMartin.Puttkammer@nwu.ac.zaen_ZA
dc.contact.nameMartin Puttkammeren_ZA
dc.contributor.authorEiselen, Roald
dc.contributor.authorPuttkammer, Martin
dc.contributor.authorHocking, Justin
dc.contributor.authorKruger, Albertus
dc.date.accessioned2018-05-24T13:39:01Z
dc.date.available2018-05-24T13:39:01Z
dc.date.issued2018-05-24
dc.descriptionCTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and word lists, collocation searches and statistical analysis of corpus data. Furthermore, the tool provides automatic processing of corpus data on five levels: sentence separation, tokenisation, part of speech tagging, named entity recognition, and phrase chunking. \n\n Available in the Help file - Output classes for named entity recognition: \n B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outsideen_ZA
dc.format.size258 Mben_ZA
dc.identifier.urihttps://hdl.handle.net/20.500.12185/480
dc.language.isoafr
dc.language.isoeng
dc.language.isonbl
dc.language.isoxho
dc.language.isozul
dc.language.isosot
dc.language.isonso
dc.language.isotsn
dc.language.isossw
dc.language.isoven
dc.language.isotso
dc.languagesAfrikaansen_ZA
dc.languagesEnglishen_ZA
dc.languagesisiNdebeleen_ZA
dc.languagesisiXhosaen_ZA
dc.languagesisiZuluen_ZA
dc.languagesSesotho sa Leboa (Sepedi)en_ZA
dc.languagesSetswanaen_ZA
dc.languagesSesothoen_ZA
dc.languagesSiswatien_ZA
dc.languagesTshivendaen_ZA
dc.languagesXitsongaen_ZA
dc.media.typeTexten_ZA
dc.projectNCHLT Text IIIen_ZA
dc.publisherNorth-West University, Centre for Text Technology (CTexT)en_ZA
dc.publisherSouth African Department of Arts and Cultureen_ZA
dc.rights.licenseCreative Commons Attribution-NoDerivatives 4.0 International: https://creativecommons.org/licenses/by-nd/4.0/en_ZA
dc.software.requirementsJava Runtime Environment 8
dc.subjectText analysis toolsen_ZA
dc.subjectSouth African languagesen_ZA
dc.subjectPart of speech taggingen_ZA
dc.subjectNamed entity recognitionen_ZA
dc.subjectCorpus processingen_ZA
dc.subjectPhrase chunking
dc.subjectTokenisation
dc.subjectSentence separation
dc.titleCTexTools 2en_ZA
dc.typeToolsen_ZA
dc.version2.1en_ZA
local.collection.primaryResource Catalogue
local.collection.secondaryResource Index
local.urlhttps://humanities.nwu.ac.za/ctexten_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Install_CTexTools2.zip
Size:
252.61 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: