NCHLT Tagger
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Download
MD5: de10b62048ef294621e22a12c0b647e4
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Collections
- Resource Catalogue [350]
- Resource Index [412]
Author(s)
Roald Eiselen
Metadata
Show full item recordDescription
A graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:
* Part of Speech
* Named entity type
* Phrase chunks
Available for the following languages:
Afrikaans
English
isiNdebele
isiXhosa
isiZulu
Sesotho sa Leboa (Sepedi)
Setswana
Sesotho (Southern Sotho)
Siswati
Tshivenda
Xitsonga
Available in the Readme.txt - Input format: Utf8 text file containing running text. Output file format: The output file is a tab-delimited text file containing each token followed by its the assigned class. Output classes for named entity recognition: B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside
* Part of Speech
* Named entity type
* Phrase chunks
Available for the following languages:
Afrikaans
English
isiNdebele
isiXhosa
isiZulu
Sesotho sa Leboa (Sepedi)
Setswana
Sesotho (Southern Sotho)
Siswati
Tshivenda
Xitsonga
Available in the Readme.txt - Input format: Utf8 text file containing running text. Output file format: The output file is a tab-delimited text file containing each token followed by its the assigned class. Output classes for named entity recognition: B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside
Contact person
Martin PuttkammerContact person's e-mail address
Martin.Puttkammer@nwu.ac.zaPublisher(s)
North-West University
Centre for Text Technology (CTexT)