Show simple item record

Tagger Parameter file for RF-Tagger (Schmid and Laws 2005)
The tagger parameter file is trained on an excerpt of the Pretoria Sepedi Corpus (D. Prinsloo, University of Pretoria): Here, about 5000 tokens were manually tagged and used for training the RF-Tagger (Helmut Schmid and Florian Laws: Estimation of Conditional Probabilities with Decision Trees and an Application to Fine-Grained POS Tagging, COLING 2008, Manchester, Great Britain). The tagger is freely available for academic purposes (see http://www.cis.uni-muenchen.de/~schmid/tools/RFTagger/). Methods and validation results can be found in: G. Faaß, U. Heid, E. Taljard, and D.J. Prinsloo. Part-of-Speech tagging in Northern Sotho: disambiguating polysemous function words. In Proceedings of the 1st Workshop on Language Technologies for African Languages - AfLaT 2009 at EACL, pages 38-45, Athens, Greece, 2009.
Gertrud Faass
gertrud.faass@uni-hildesheim.de
Institute for Information Science and Natural Language Processing, University of Hildesheim, Germany
by-nc-sa
Sesotho sa Leboa (Sepedi)
tagger parameter file; statistical tagging; RF-Tagger
https://hdl.handle.net/20.500.12185/483
Text
Data
Statistical language model
1
UTF8
unknown
Resource Index
nso
2019-02-01T05:49:35Z
2019-02-01T05:49:35Z
2018-02-21


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Resource Index [411]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record