Calomo
Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/144'
dc.contact.email | Martin.Puttkammer@nwu.ac.za | |
dc.contact.name | Martin Puttkammer | |
dc.contributor.author | Menno van Zaanen | |
dc.date.accessioned | 2018-02-05T07:33:07Z | |
dc.date.accessioned | 2018-03-05T14:58:06Z | |
dc.date.available | 2018-02-05T07:33:07Z | |
dc.date.available | 2018-03-05T14:58:06Z | |
dc.date.issued | 2015-01-30 | |
dc.description | Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hon-de-hok-dak", where the hyphen indicates syllable boundaries on an orthographic level. Calomo is a C5 classifier, trained on data consisting of circa 40,000 words. The resulting decision tree and cases can be converted to C code by means of a script written by M.M. van Zaanen. This C code can then be implemented in any other system. | |
dc.identifier.uri | https://hdl.handle.net/20.500.12185/144 | |
dc.language.iso | afr | |
dc.languages | Afrikaans | |
dc.media.category | Hyphenator | |
dc.media.type | Text | |
dc.publisher | North-West University | |
dc.publisher | Centre for Text Technology (CTexT) | |
dc.title | Calomo | |
dc.type | Modules | |
local.collection.primary | Resource Index |