Search
Now showing items 231-240 of 240
Afrikaans text unit identification data
(Centre for Text Technology, North-West University, 2006) ~ - Resource Catalogue
This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ...
Setswana Test suite and Treebank
(North-West University, 2018-03-27) ~ - Resource Catalogue
The main aim of the PhD study "A computational syntactic analysis of Setswana"(AS Berg, May 2018) is the computational syntactic analysis of the Setswana ...
Afrikaans lexical blends dataset
(North-West University, 2023-12)
This a dataset of Afrikaans blend constructions that have been collected and analysed using the Levenshtein distance metric. This dataset serves as the ...
Afribooms Afrikaans Dependency Treebank
(North-West University; Centre for Text Technology (CTexT); Katholieke Universiteit Leuven (Belgium), 2015-02-10) ~ - Resource Catalogue
This is the annotated corpus developed for Afrikaans for the Afribooms project. The corpus includes annotations for lemma, part-of-speech (POS) and ...
NWU TransTips 1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
TransTips is a PHP programming script that browses a web page for terms in the database. The translation of words contained in the database is linked ...
CTexT Alignment Interface Pro
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ - Resource Catalogue
Utility application for the manual alignment of source texts. Pro version allows for the editing of the segments.
CTexT Alignment Interface
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ - Resource Catalogue
Utility application for the manual alignment of source texts.
Afrikaans TnT-Tagger
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
The Afrikaans TnT-tagger is a part of speech tagger that can be used to add part of speech tags to Afrikaans texts.The tagger is an Afrikaans version ...
TurboAnnotate1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
TurboAnnotate is a user-friendly annotating environment (i.e. tool) for bootstrapping linguistic data for machine-learning purposes, or for manually ...
CTexTools
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Corpus query and manipulation tool for performing tokenisation and sentencisation; extracting frequency list and word list; searching; and extracting ...