Repository logoRepository logo
 

Habakuk

Loading...
Thumbnail Image

Date

2015-01-30

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

North-West University
Centre for Text Technology (CTexT)

Abstract

Description

Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Habakuk consists of circa 1,010 rules implemented in Perl. These rules are based on the 2002 edition of Afrikaanse Woordelys en Spelreëls.

Keywords

Citation

License

Collections

Verification status

Level 0