Search
Now showing items 21-30 of 104
Multilingual Mathematics Terminology List (Grade R - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
984 English source terms with their equivalents in the ten other official South African languages. The list was compiled in collaboration with subject ...
CTexTools
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Corpus query and manipulation tool for performing tokenisation and sentencisation; extracting frequency list and word list; searching; and extracting ...
COVID-19 Multilingual Terminology
(City of Tshwane; South African Centre for Digital Language Resources (SADiLaR); Department of Science and Innovation; Pan South African Language Board (PanSALB), 2021-07)
COVID-19 multilingual terminology list document in all the South African languages. The development of this terminology list was initiated by City of ...
South African Multilingual Proper Names (Multipron) Corpus
(Molo Afrika Speech Technologies, 2013-10-03) ~ - Resource Catalogue
Audio, orthographic and auditory verified broad phonemic transcriptions of proper names in four languages, produced by speakers of the same four languages.
Lara2
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Tool for annotating texts with lemma, part of speech and morphological analysis information
Lwazi Sesotho ASR corpus
(Meraka Institute, CSIR, 2013-04-02) ~ - Resource Catalogue
Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.
NCHLT Sesotho Annotated Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ - Resource Catalogue
Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
High quality TTS data for four South African languages (af, st, tn, xh)
(Google; North-West University, 2017) ~ - Resource Catalogue
This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ...
Lwazi II Sesotho TTS Corpus
(Meraka Institute, CSIR; North-West University, 2015-11-20) ~ - Resource Catalogue
Orthographic and phonemically aligned transcriptions.
NCHLT Sesotho GloVe embeddings
(North-West University; Centre for Text Technology (CTexT), 2023-05-01)
Static word embedding model based on the Global Vectors architecture (Pennington et al., 2014). The embeddings provide real-valued vector representations ...