Title | Calomo |
Description | Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Calomo is a C5 classifier, trained on data consisting of circa 40,000 words. The resulting decision tree and cases can be converted to C code by means of a script written by M.M. van Zaanen. This C code can then be implemented in any other system. |
Contact name | Martin Puttkammer |
Contact email | Martin.Puttkammer@nwu.ac.za |
Publisher(s) | North-West University; Centre for Text Technology (CTexT) |
Language(s) | Afrikaans |
Author(s) | Menno van Zaanen |
URI | https://hdl.handle.net/20.500.12185/144 |
Media type | Text |
Type | Modules |
Media category | Hyphenator |
Primary collection | Resource Index |
ISO639 code | afr |
Submit date | 2018-02-05T07:33:07Z; 2018-03-05T14:58:06Z |
Date available | 2018-02-05T07:33:07Z; 2018-03-05T14:58:06Z |
Date created | 2015-01-30 |