Search
Now showing items 31-40 of 345
NCHLT isiZulu fastText-Skipgram embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word and subword embeddings for the Skipgram flavour of the fastText architecture (Bojanowski et al., 2017). The embedding provides real-valued ...
Homophone Disambiguatior 1.0.
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~ - Resource Index
Developed to disambiguate Afrikaans homophones.
CGE's Sesotho Gender Terminology List
(Commission for Gender Equality (CGE), 2018)
CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ...
NCHLT Tshivenḓa fastText-CBoW embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word and subword embeddings for the continuous bag of words (CBoW) flavour of the fastText architecture (Bojanowski et al., 2017). The embedding ...
Afrikaans Pronunciation Dictionary
(Meraka Institute, CSIR; Stellenbosch University, 2015-01-28) ~ - Resource Index
Pronunciation dictionary compiled from Lwazi ASR dictionary and the "Taalkommissie Korpus". Dictionary to be used for the development of a large vocabulary ...
Autshumato TMS
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.
Autshumato PDF Text Extractor
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Utility application for extracting text out of a PDF document. The pages can also be extracted as images.
Sesotho Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for Sesotho.
NCHLT Afrikaans RoBERTa language model
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Contextual masked language model based on the RoBERTa architecture (Liu et al., 2019). The model is trained as a masked language model and not fine-tuned ...
Pharos 5-in-1 Dictionaries
(Pharos Dictionaries, 2013-07-01) ~ - Resource Index
Collection of three bilingual (Afr/Eng) dictionaries and two monolingual (Afr) dictionaries