Title | Habakuk |
Description | Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Habakuk consists of circa 1,010 rules implemented in Perl. These rules are based on the 2002 edition of Afrikaanse Woordelys en Spelreëls. |
Contact name | Martin Puttkammer |
Contact email | Martin.Puttkammer@nwu.ac.za |
Publisher(s) | North-West University; Centre for Text Technology (CTexT) |
Language(s) | Afrikaans |
URI | https://hdl.handle.net/20.500.12185/147 |
Media type | Text |
Type | Modules |
Media category | Compound Analyser |
Version | N/A |
Primary collection | Resource Index |
ISO639 code | afr |
Submit date | 2018-02-05T07:33:07Z; 2018-03-05T14:58:07Z |
Date available | 2018-02-05T07:33:07Z; 2018-03-05T14:58:07Z |
Date created | 2015-01-30 |