Show simple item record

Habakuk
Habakuk is a rule-based hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Habakuk consists of circa 1,010 rules implemented in Perl. These rules are based on the 2002 edition of Afrikaanse Woordelys en Spelreëls.
Martin Puttkammer
Martin.Puttkammer@nwu.ac.za
North-West University; Centre for Text Technology (CTexT)
Afrikaans
https://hdl.handle.net/20.500.12185/147
Text
Modules
Compound Analyser
N/A
Resource Index
afr
2018-02-05T07:33:07Z; 2018-03-05T14:58:07Z
2018-02-05T07:33:07Z; 2018-03-05T14:58:07Z
2015-01-30


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Resource Index [411]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record