Repository logoRepository logo
 

NCHLT Part of Speech Taggers

dc.contact.emailMartin.Puttkammer@nwu.ac.za
dc.contact.email
dc.contact.nameMartin Puttkammer
dc.contributor.authorMartin Puttkammer
dc.contributor.authorMartin Schlemmer
dc.date.accessioned2018-02-05T20:25:58Z
dc.date.accessioned2018-03-05T17:46:35Z
dc.date.available2018-02-05T20:25:58Z
dc.date.available2018-03-05T17:46:35Z
dc.date.issued2014-05-30
dc.descriptionPart of speech taggers developed during the NCHLT Text project. Available for the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sesotho sa Leboa (Sepedi), Setswana, Sesotho (Southern Sotho), Siswati, Tshivenda, and Xitsonga.
dc.identifier.citationEiselen, E.R. & Puttkammer, M.J. 2014. Developing text resources for ten South African languages. (In Proceedings of the 9th International Conference on Language Resources and Evaluation, Reykjavik, Iceland. p. 3698-3703)
dc.identifier.urihttps://hdl.handle.net/20.500.12185/323
dc.language.isoafr
dc.language.isonbl
dc.language.isoxho
dc.language.isozul
dc.language.isosot
dc.language.isonso
dc.language.isotsn
dc.language.isossw
dc.language.isoven
dc.language.isotso
dc.languagesAfrikaans
dc.languagesisiNdebele
dc.languagesisiXhosa
dc.languagesisiZulu
dc.languagesSesotho sa Leboa (Sepedi)
dc.languagesSetswana
dc.languagesSesotho
dc.languagesSiswati
dc.languagesTshivenda
dc.languagesXitsonga
dc.media.categoryPOS tagger
dc.media.typeText
dc.projectNCHLT Text
dc.publisherNorth-West University
dc.publisherCentre for Text Technology (CTexT)
dc.rights.licenseCreative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcode
dc.software.requirementshunpos (http://code.google.com/p/hunpos/) required to build own models (optional) and for use in Linux.
dc.subjectpart of speech
dc.subjectpos tagger
dc.subjectAfrikaans
dc.subjectisiNdebele
dc.subjectisiXhosa
dc.subjectisiZulu
dc.subjectSiswati
dc.subjectSetswana
dc.subjectSesotho sa Leboa
dc.subjectTshivenda
dc.subjectXitsonga
dc.subjectSesoth
dc.titleNCHLT Part of Speech Taggers
dc.typeModules
dc.version1
local.collection.primaryResource Catalogue
local.collection.secondaryResource Index

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
pos-taggers.nchlt.zip
Size:
20.49 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.