Search
Now showing items 241-250 of 345
Calomo
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, ...
NCHLT Text Web Services
(SADiLaR; North-West University, 2018-03-01) ~ - Resource Index
A web service that provides access to seven core technologies in ten South African languages, including:
* Tokenisers
* Sentence separators
* ...
Open Spell (v1.0)
(Meraka Institute, CSIR; TEIR; ICSI at University of California (Berkeley), 2013-07-01) ~ - Resource Index
Open Spell is spelling game that provides spelling exercises (in the language education domain) to teach spelling skills to schoolchildren between the ...
Xitsonga Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for Xitsonga.
NCHLT isiXhosa fastText-CBoW embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word and subword embeddings for the continuous bag of words (CBoW) flavour of the fastText architecture (Bojanowski et al., 2017). The embedding ...
NCHLT Tshivenḓa FLAIR-forward embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Contextual word/string embeddings for the forward flavour of the FLAIR architecture (Akbik et al., 2018). The embedding provides real-valued vector ...
MobiLex and SA Trilingual Wine Industry Dictionary
(Stellenbosch University; Winetech; SAWIS, 2018-02-28) ~ - Resource Index
Trilingual LSP dictionary on cellphone and website
Trilingual LSP dictionary on website
CTexT Afrikaanse Grammatikatoetser 1.0
(Centre for Text Technology (CTexT), 2013-07-01) ~ - Resource Index
The CTexT Afrikaanse Grammatikatoetser is the first Afrikaans grammar checker for Microsoft Office, and can identify a number of grammatical and style errors.
Autshumato English-Xitsonga Manually Translated Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-12-12) ~ - Resource Catalogue
Aligned English-Xitsonga parallel corpus. The data is given as two seperate UTF-8 text files; with each segment on a newline.
African Wordnet: Tshivenda 1.0
(UNISA, 2017-06-20) ~ - Resource Catalogue
Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ...