Search
Now showing items 21-30 of 240
NCHLT South African Language Identifier
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
A graphical user interface and command line tool to automatically classify a document, paragraph, sentence or phrase as one of the eleven official South ...
NCHLT Afrikaans Lemmatiser
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatiser developed during the NCHLT Text project. \n\n
Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one lowercase ...
Homophone Disambiguatior 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Developed to disambiguate Afrikaans homophones.
CGE's Sesotho Gender Terminology List
(Commission for Gender Equality (CGE), 2018)
CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ...
Afrikaans Pronunciation Dictionary
(Meraka Institute, CSIR; Stellenbosch University, 2015-01-28) ~ - Resource Index
Pronunciation dictionary compiled from Lwazi ASR dictionary and the "Taalkommissie Korpus". Dictionary to be used for the development of a large vocabulary ...
Autshumato TMS
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.
Autshumato PDF Text Extractor
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Utility application for extracting text out of a PDF document. The pages can also be extracted as images.
Sesotho Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for Sesotho.
Pharos 5-in-1 Dictionaries
(Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Collection of three bilingual (Afr/Eng) dictionaries and two monolingual (Afr) dictionaries
NCHLT Afrikaans Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...