CTexTools 2
Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/480'
dc.contact.email | Martin.Puttkammer@nwu.ac.za | en_ZA |
dc.contact.name | Martin Puttkammer | en_ZA |
dc.contributor.author | Eiselen, Roald | |
dc.contributor.author | Puttkammer, Martin | |
dc.contributor.author | Hocking, Justin | |
dc.contributor.author | Kruger, Albertus | |
dc.date.accessioned | 2018-05-24T13:39:01Z | |
dc.date.available | 2018-05-24T13:39:01Z | |
dc.date.issued | 2018-05-24 | |
dc.description | CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and word lists, collocation searches and statistical analysis of corpus data. Furthermore, the tool provides automatic processing of corpus data on five levels: sentence separation, tokenisation, part of speech tagging, named entity recognition, and phrase chunking. \n\n Available in the Help file - Output classes for named entity recognition: \n B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside | en_ZA |
dc.format.size | 258 Mb | en_ZA |
dc.identifier.uri | https://hdl.handle.net/20.500.12185/480 | |
dc.language.iso | afr | |
dc.language.iso | eng | |
dc.language.iso | nbl | |
dc.language.iso | xho | |
dc.language.iso | zul | |
dc.language.iso | sot | |
dc.language.iso | nso | |
dc.language.iso | tsn | |
dc.language.iso | ssw | |
dc.language.iso | ven | |
dc.language.iso | tso | |
dc.languages | Afrikaans | en_ZA |
dc.languages | English | en_ZA |
dc.languages | isiNdebele | en_ZA |
dc.languages | isiXhosa | en_ZA |
dc.languages | isiZulu | en_ZA |
dc.languages | Sesotho sa Leboa (Sepedi) | en_ZA |
dc.languages | Setswana | en_ZA |
dc.languages | Sesotho | en_ZA |
dc.languages | Siswati | en_ZA |
dc.languages | Tshivenda | en_ZA |
dc.languages | Xitsonga | en_ZA |
dc.media.type | Text | en_ZA |
dc.project | NCHLT Text III | en_ZA |
dc.publisher | North-West University, Centre for Text Technology (CTexT) | en_ZA |
dc.publisher | South African Department of Arts and Culture | en_ZA |
dc.rights.license | Creative Commons Attribution-NoDerivatives 4.0 International: https://creativecommons.org/licenses/by-nd/4.0/ | en_ZA |
dc.software.requirements | Java Runtime Environment 8 | |
dc.subject | Text analysis tools | en_ZA |
dc.subject | South African languages | en_ZA |
dc.subject | Part of speech tagging | en_ZA |
dc.subject | Named entity recognition | en_ZA |
dc.subject | Corpus processing | en_ZA |
dc.subject | Phrase chunking | |
dc.subject | Tokenisation | |
dc.subject | Sentence separation | |
dc.title | CTexTools 2 | en_ZA |
dc.type | Tools | en_ZA |
dc.version | 2.1 | en_ZA |
local.collection.primary | Resource Catalogue | |
local.collection.secondary | Resource Index | |
local.url | https://humanities.nwu.ac.za/ctext | en_ZA |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Install_CTexTools2.zip
- Size:
- 252.61 MB
- Format:
- ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.
- Description:
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: