Repository logoRepository logo
 

CKarma

dc.contact.emailMartin.Puttkammer@nwu.ac.za
dc.contact.nameMartin Puttkammer
dc.date.accessioned2018-02-05T07:33:07Z
dc.date.accessioned2018-03-05T14:58:06Z
dc.date.available2018-02-05T07:33:07Z
dc.date.available2018-03-05T14:58:06Z
dc.date.issued2015-01-30
dc.descriptionCKarma is a compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and produces as output an analysed string, without any tags. For example, the string "hondehokdak" ('dog house roof') will be analysed as "hond _ e + hok + dak", where the plus sign indicates the beginning of an independent constituent, and the underscore the beginning of a dependent constituent (i.e. a valence morpheme). CKarma is a C5 classifier, trained on data consisting of circa 47,000 compound and 7,000 non-compounds. The resulting decision tree and cases can be converted to C code by means of a script written by MM van Zaanen. This C code can then be implemented in any other system.
dc.identifier.urihttps://hdl.handle.net/20.500.12185/145
dc.language.isoafr
dc.languagesAfrikaans
dc.media.categoryCompound Analyser
dc.media.typeText
dc.publisherNorth-West University
dc.publisherCentre for Text Technology (CTexT)
dc.titleCKarma
dc.typeModules
dc.versionN/A
local.collection.primaryResource Index

Files

Collections