Habakuk
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Collections
- Resource Index [412]
Metadata
Show full item recordDescription
Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Habakuk consists of circa 1,010 rules implemented in Perl. These rules are based on the 2002 edition of Afrikaanse Woordelys en Spelreëls.
Contact person
Martin PuttkammerContact person's e-mail address
Martin.Puttkammer@nwu.ac.zaPublisher(s)
North-West University
Centre for Text Technology (CTexT)