Show simple item record

Hyphenator 1.0.
Rule-based hyphenator which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Rules implemented in Perl.
Martin Puttkammer
North-West University; Centre for Text Technology (CTexT)
isiXhosa; isiZulu; Sesotho sa Leboa (Sepedi); Setswana
Monolingual text corpora: Unannotated
Resource Index
xho; zul; nso; tsn
2018-02-05T07:33:08Z; 2018-03-05T14:58:08Z
2018-02-05T07:33:08Z; 2018-03-05T14:58:08Z

Files in this item


There are no files associated with this item.

This item appears in the following Collection(s)

  • Resource Index [410]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record