Browsing Resource Index by Title
Filter by:
Now showing items 80-99 of 411
-
Autshumato Sesotho sa Leboa-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from Sesotho sa Leboa to English (EN-GB), in the government domain for use in the Autshumato ITE application. -
Autshumato Setswana Monolingual Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~Resource Catalogue Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line. -
Autshumato Text Anonymiser
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~Resource Catalogue Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ... -
Autshumato TMS
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology. -
Autshumato TMX Integrator
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~Resource Catalogue Utility to merge multiple translation memories over a network using Subversion -
Autshumato Xitsonga Frequency Word List
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~Resource Catalogue A list of the most frequent Xitsonga words as deliverable of the Autshumato project. -
Autshumato Xitsonga Monolingual Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~Resource Catalogue Xitsonga monolingual corpus as deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a newline. -
Bilingual English-isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~Resource Catalogue Aligned parallel corpora for the following language pair: English-isiXhosa. The data is given as two separate UTF-8 text files, with each segment on a ... -
Bilingual English-Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)Aligned parallel corpora for the following language pair: English-SiSwati. The data is given as four separate UTF-8 text files, with each segment on a ... -
Boomerang v1.0
(, 2013-07-01) ~Resource Index Performs outbound call campaigns and executes Javascript-scripted IVR callflows especially for marketing companies companies with extensive client ... -
Bukantswe Sesotho-English Bilingual Dictionary
(North-West University, 2016-07-07) ~Resource Catalogue Bilingual English-Sesotho dictionary. This dataset represents a basic Sesotho dictionary compiled in the creation of a Sesotho language resource. The ... -
Calomo
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~Resource Index Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, ... -
CGE's Sesotho Gender Terminology List
(Commission for Gender Equality (CGE), 2018)CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ... -
CKarma
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~Resource Index CKarma is a compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and produces ... -
Combination Tagger
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~Resource Index The combination tagger framework uses MBT, SVM, MXPOST and TnT. Each tagger receives a weight by which it can vote for a tag. -
CompanyCall v1.0
(, 2013-07-01) ~Resource Index Routes calls based on spoken company name, south african names, ses ASR name re ??, which domain) directory assistance, 08606companycall.co.za. -
CorpusCatcher
(Translate.org.za, 2015-01-28) ~Resource Index Corpus Catcher is a tool that is designed to crawl the web to retrieve data for inclusion in a corpus. It makes use of seed documents/wordlists to ... -
CTexT Afrikaanse Grammatikatoetser 1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~Resource Index The CTexT Afrikaanse Grammatikatoetser is the first Afrikaans grammar checker for Microsoft Office, and can identify a number of grammatical and style errors. -
CTexT Afrikaanse Tesourus 1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~Resource Index The CTexT Afrikaanse Thesaurus is based on the Groot Afrikaans Tesourus of De Stadler. -
CTexT Alignment Interface
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~Resource Catalogue Utility application for the manual alignment of source texts.