DSI LogoSADiLaR Logo
Clarin-ZA Logo
View Item 
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Index
  • View Item
  •   SADiLaR
  • Language Resource Management Agency
  • Resource Index
  • View Item
    • Login
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Search form

    Browse

    All of SADiLaR

    Communities & CollectionsTitleProjectMedia type

    This Collection

    TitleProjectMedia type

    Calomo

    Thumbnail
    URI
    https://hdl.handle.net/20.500.12185/144
    Collections
    • Resource Index [409]
    Author(s)
    Menno van Zaanen
    Metadata
    Show full item record
    Description
    Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Calomo is a C5 classifier, trained on data consisting of circa 40,000 words. The resulting decision tree and cases can be converted to C code by means of a script written by M.M. van Zaanen. This C code can then be implemented in any other system.
    Contact person
    Martin Puttkammer
    Contact person's e-mail address
    Martin.Puttkammer@nwu.ac.za
    Publisher(s)
    North-West University
    Centre for Text Technology (CTexT)

    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback
     

     


    Copyright © 2018  SADiLaR. All Rights Reserved.
    Contact Us | Send Feedback