Show simple item record

Ragel was developed by using traditional methods for stemming/lemmatisation (i.e. affix stripping), and consists of language-specific rules for identifying word-forms in the lexicon of the spelling checker. However, Ragel cannot be considered either a "pure" lemmatiser or a "pure" stemmer in the true sense of the word, since it was developed specifically for purposes of spelling checking. In this sense, both derived and inflected word-forms that are not in the lexicon of the spelling checker are analysed by Ragel, only until a word in the lexicon is found, whether that word is a lemma or not (e.g. "ontbossing" 'deforestation' will be analysed as "ontbos" 'deforest' and not necessarily as "bos" 'forest'). Moreover, Ragel does not cover the full complexity of Afrikaans morphology, but only a subset that was considered (a) highly productive; and (b) unproblematic in terms of morphonological changes.  Ragel has been implemented in Afrikaanse Speltoetser 3.0, Tesourus 1.0 en Woordafbreker.
Roald Eiselen
North-West University; Centre for Text Technology (CTexT)
Resource Index
2018-02-05T07:33:08Z; 2018-03-05T14:58:07Z
2018-02-05T07:33:08Z; 2018-03-05T14:58:07Z

Files in this item


There are no files associated with this item.

This item appears in the following Collection(s)

  • Resource Index [375]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record