DSI LogoSADiLaR Logo
Clarin-ZA Logo
View Item 
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Index
  • View Item
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Index
  • View Item
    • Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Search form

    Browse

    All of SADiLaR

    Communities & CollectionsTitleProjectMedia type

    This Collection

    TitleProjectMedia type

    Habakuk

    Thumbnail
    URI
    https://hdl.handle.net/20.500.12185/147
    Collections
    • Resource Index [409]
    Metadata
    Show full item record
    Description
    Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Habakuk consists of circa 1,010 rules implemented in Perl. These rules are based on the 2002 edition of Afrikaanse Woordelys en Spelreëls.
    Contact person
    Martin Puttkammer
    Contact person's e-mail address
    Martin.Puttkammer@nwu.ac.za
    Publisher(s)
    North-West University
    Centre for Text Technology (CTexT)

    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback
     

     


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback