Repository logoRepository logo
 

NCHLT Tagger

dc.contact.emailMartin.Puttkammer@nwu.ac.za
dc.contact.nameMartin Puttkammer
dc.contributor.authorRoald Eiselen
dc.date.accessioned2018-02-05T20:22:39Z
dc.date.accessioned2018-03-05T17:47:47Z
dc.date.available2018-02-05T20:22:39Z
dc.date.available2018-03-05T17:47:47Z
dc.date.issued2016-04-29
dc.descriptionA graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:\n* Part of Speech\n* Named entity type\n* Phrase chunks\n\nAvailable for the following languages:\nAfrikaans\nEnglish\nisiNdebele\nisiXhosa\nisiZulu\nSesotho sa Leboa (Sepedi)\nSetswana\n Sesotho (Southern Sotho)\nSiswati\nTshivenda\nXitsonga\n\n Available in the Readme.txt - Input format: Utf8 text file containing running text. Output file format: The output file is a tab-delimited text file containing each token followed by its the assigned class. Output classes for named entity recognition: B-/I-ORG Organisation, B-/I-PER Person, B-/I-LOC Location, B-/I-MISC Miscellaneous, OUT Outside
dc.format.extent83.99 MB (zipped)
dc.identifier.urihttps://hdl.handle.net/20.500.12185/351
dc.language.isoafr
dc.language.isonbl
dc.language.isoxho
dc.language.isozul
dc.language.isosot
dc.language.isonso
dc.language.isotsn
dc.language.isossw
dc.language.isoven
dc.language.isotso
dc.languagesAfrikaans
dc.languagesisiNdebele
dc.languagesisiXhosa
dc.languagesisiZulu
dc.languagesSesotho sa Leboa (Sepedi)
dc.languagesSetswana
dc.languagesSesotho
dc.languagesSiswati
dc.languagesTshivenda
dc.languagesXitsonga
dc.media.categoryPOS tagger
dc.media.typeText
dc.projectNCHLT Text II
dc.publisherNorth-West University
dc.publisherCentre for Text Technology (CTexT)
dc.rights.licenseCreative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcode
dc.software.requirementsMicrosoft .Net 4.5
dc.titleNCHLT Tagger
dc.typeTools
dc.version1
local.collection.primaryResource Catalogue
local.collection.secondaryResource Index

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
nchlt_tagger.zip
Size:
84.12 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.