Search
Now showing items 231-238 of 238
Afrikaans text unit identification data
(Centre for Text Technology, North-West University, 2006) ~ - Resource Catalogue
This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ...
Setswana Test suite and Treebank
(North-West University, 2018-03-27) ~ - Resource Catalogue
The main aim of the PhD study "A computational syntactic analysis of Setswana"(AS Berg, May 2018) is the computational syntactic analysis of the Setswana ...
CTexT Afrikaans fastText CBoW String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)
The CTexT Afrikaans fastText CBoW String Embeddings is a 300 dimensional Afrikaans embedding model based on the Contunious Bag of Words fastText ...
Afribooms Afrikaans Dependency Treebank
(North-West University; Centre for Text Technology (CTexT); Katholieke Universiteit Leuven (Belgium), 2015-02-10) ~ - Resource Catalogue
This is the annotated corpus developed for Afrikaans for the Afribooms project. The corpus includes annotations for lemma, part-of-speech (POS) and ...
CTexT Alignment Interface Pro
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ - Resource Catalogue
Utility application for the manual alignment of source texts. Pro version allows for the editing of the segments.
CTexT Alignment Interface
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~ - Resource Catalogue
Utility application for the manual alignment of source texts.
CTexTools
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Corpus query and manipulation tool for performing tokenisation and sentencisation; extracting frequency list and word list; searching; and extracting ...
African Wordnet version 1.0
(UNISA, 2022-09-20)
Developed using the expand model with Princeton WordNet 3.1 as basis.
Please see https://africanwordnet.wordpress.com/ for all details on the project. ...