DSI LogoSADiLaR Logo
Clarin-ZA Logo
View Item 
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Index
  • View Item
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Index
  • View Item
    • Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Search form

    Browse

    All of SADiLaR

    Communities & CollectionsTitleProjectMedia type

    This Collection

    TitleProjectMedia type

    Tagger Parameter file for RF-Tagger (Schmid and Laws 2005)

    Thumbnail
    URI
    https://hdl.handle.net/20.500.12185/483
    Collections
    • Resource Index [409]
    Metadata
    Show full item record
    Description
    The tagger parameter file is trained on an excerpt of the Pretoria Sepedi Corpus (D. Prinsloo, University of Pretoria): Here, about 5000 tokens were manually tagged and used for training the RF-Tagger (Helmut Schmid and Florian Laws: Estimation of Conditional Probabilities with Decision Trees and an Application to Fine-Grained POS Tagging, COLING 2008, Manchester, Great Britain). The tagger is freely available for academic purposes (see http://www.cis.uni-muenchen.de/~schmid/tools/RFTagger/). Methods and validation results can be found in: G. Faaß, U. Heid, E. Taljard, and D.J. Prinsloo. Part-of-Speech tagging in Northern Sotho: disambiguating polysemous function words. In Proceedings of the 1st Workshop on Language Technologies for African Languages - AfLaT 2009 at EACL, pages 38-45, Athens, Greece, 2009.
    Contact person
    Gertrud Faass
    Contact person's e-mail address
    gertrud.faass@uni-hildesheim.de
    Publisher(s)
    Institute for Information Science and Natural Language Processing, University of Hildesheim, Germany
    License
     

    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback
     

     


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback