Search
Now showing items 101-110 of 240
NCHLT Sesotho Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...
Afrikaans Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
WAT A-Q
(WAT (Afrikaans NLU), 2013-07-01) ~ - Resource Index
Die Elektroniese WAT A-Q is a comprehensive monolingual Afrikaans dictionary, distributed together with Woordkeusegids - 'n Kerntesourus van Afrikaans ...
NCHLT Afrikaans Named Entity Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
Sepedi Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
Translate.org.za isiZulu - isiXhosa Corpus 2012
(Translate.org.za, 2013-06-19) ~ - Resource Catalogue
isiZulu-isiXhosa translation memory.
Tshivenda Spelling Checker 1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
Spelling checkers and hyphenators for South African languages compatible with Microsoft® Office 2000, XP, 2003, 2007, 2010 or Microsoft® Office 2013. ...
SAE Newspaper Text Corpus
(Stellenbosch University, 2015-01-27) ~ - Resource Index
Newspaper text in electronic format obtained from Avusa Media through a licensing agreement (renewed anually).
NCHLT isiNdebele Annotated Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
Tshivenda Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...