Search
Now showing items 31-40 of 60
NCHLT isiNdebele Named Entity Annotated Corpus
(North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ - Resource Catalogue
Named entity annotated data from the NCHLT Text Resource Development: Phase II Project, annotated with PERSON, LOCATION, ORGANISATION and MISCELLANEOUS tags.
NCHLT isiNdebele Morphological Decomposer
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Morphological decomposer developed during the NCHLT Text project.
Multilingual Life Orientation Intermediate Phase Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
1628 English source terms with their equivalents in the ten other official South African languages. The terms were excerpted from life orientation ...
Multilingual Parliamentary / Political Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
502 English source terms with their equivalents in the ten other official South African languages. The project built on a 2003 initiative of the national ...
NCHLT isiNdebele word2vec-CBOW embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word embeddings for the continuous bag of words (CBoW) flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides ...
NCHLT Text Web Services
(SADiLaR; North-West University, 2018-03-01) ~ - Resource Index
A web service that provides access to seven core technologies in ten South African languages, including:
* Tokenisers
* Sentence separators
* ...
Open Spell (v1.0)
(Meraka Institute, CSIR; TEIR; ICSI at University of California (Berkeley), 2013-07-01) ~ - Resource Index
Open Spell is spelling game that provides spelling exercises (in the language education domain) to teach spelling skills to schoolchildren between the ...
LID classifier
(CSIR, 2018-03-02) ~ - Resource Index
An LID service that allows for token level classification in the 11 official languages of South Africa
Multilingual Election Terminology List
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-02-15) ~ - Resource Index
559 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
Autshumato TMX Integrator
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Utility to merge multiple translation memories over a network using Subversion