Search
Now showing items 11-20 of 240
Pharos Speltoetser en Woordafbreker
(Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Corrects typing and spelling errors and hyphenate words correctly
Autshumato English-isiZulu Parallel Corpora
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
Aligned parallel corpora for the language pair English-isiZulu. The data is given as two separate UTF-8 text files, with each aligned segment on a ...
Autshumato English-Xitsonga Parallel Corpora
(CTexT® (Centre for Text Technology, North-West University), 2022-09-30)
Aligned parallel corpora for the language pair English-Xitsonga. The data is given as two separate UTF-8 text files, with each aligned segment on a ...
UNISA Multilingual Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
The resource comprises a diverse selection of TEI P5 marked up documents from institutional origin, written in Tswana, Afrikaans and English. This is ...
NCHLT Tagger
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
A graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:\n* Part of Speech\n* Named ...
NCHLT isiXhosa Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
NCHLT isiZulu Phrase Chunk Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...
Autshumato Xitsonga Frequency Word List
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ - Resource Catalogue
A list of the most frequent Xitsonga words as deliverable of the Autshumato project.
Afrikaans linking element dataset
(North-West University, 2019) ~ - Resource Catalogue
(Afrikaans follows English)
This data set was compiled for a study in which the possible semantic content of Afrikaans linking elements was investigated. ...
NCHLT South African Language Identifier
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
A graphical user interface and command line tool to automatically classify a document, paragraph, sentence or phrase as one of the eleven official South ...