Repository logoRepository logo
 

TurboAnnotate1.0

Loading...
Thumbnail Image

Date

2013-07-01

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Centre for Text Technology (CTexT)

Abstract

Description

TurboAnnotate is a user-friendly annotating environment (i.e. tool) for bootstrapping linguistic data for machine-learning purposes, or for manually creating gold standards or other annotated lists. This first version of TurboAnnotate was developed with the specific the task of hyphenation for South African languages in mind. In the annotation GUI, the annotator simply drags the mouse over the part of the word to be annotated, and on release of the mouse button, the selection changes colour. The machine learning system that we use in our system is the well-known Tilburg Memory-Based Learner (TiMBL; Daelemans et al, 2004). Van Huyssteen & Puttkammer (2007) reports that TurboAnnotate could not only ensure higher accuracy in human annotations, but could also save on human effort required (at least in the case of Afrikaans). Work on TurboAnnotate continues.

Keywords

Citation

License

Collections

Verification status

Level 0