Browsing Language Resource Management Agency by Title
Filter by:
Now showing items 91-110 of 529
-
Bambara Monolingual Children First Language Acquisition (Babbling & First Words)
(Ibrahima Abdoul Hayou CISSE, 2010)Dataset contains videos of children interacting with caregivers. Languages included: Bambara/Bamanakan/Dioula/Mande -
Bilingual English-isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~Resource Catalogue Aligned parallel corpora for the following language pair: English-isiXhosa. The data is given as two separate UTF-8 text files, with each segment on a ... -
Bilingual English-Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)Aligned parallel corpora for the following language pair: English-SiSwati. The data is given as four separate UTF-8 text files, with each segment on a ... -
Boomerang v1.0
(, 2013-07-01) ~Resource Index Performs outbound call campaigns and executes Javascript-scripted IVR callflows especially for marketing companies companies with extensive client ... -
Bukantswe Sesotho-English Bilingual Dictionary
(North-West University, 2016-07-07) ~Resource Catalogue Bilingual English-Sesotho dictionary. This dataset represents a basic Sesotho dictionary compiled in the creation of a Sesotho language resource. The ... -
Calomo
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~Resource Index Calomo is a hyphenator for Afrikaans, which can be implemented in any NLP system. It takes as input a string, and produces as output an analysed string, ... -
CGE's Afrikaans Gender Terminology List
(Commission for Gender Equality (CGE), 2021-04)CGE's Afrikaans Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 436 words or phrases ... -
CGE's Sesotho Gender Terminology List
(Commission for Gender Equality (CGE), 2018)CGE's Sesotho Gender Terminology List is a list of terms, either words or phrases, related to the promotion of gender equality. All 446 words or phrases ... -
CKarma
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~Resource Index CKarma is a compound analyser for Afrikaans, to be used for the detection of word boundaries within compounds. It takes as input a string, and produces ... -
Code-switching among bilingual Afrikaans-Dutch children
(North-West University, 2021)To describe the code-switching in the speech of pre-school, bilingual Afrikaans-Dutch children, novel language data of eight preschool children between ... -
Combination Tagger
(North-West University; Centre for Text Technology (CTexT), 2015-01-30) ~Resource Index The combination tagger framework uses MBT, SVM, MXPOST and TnT. Each tagger receives a weight by which it can vote for a tag. -
CompanyCall v1.0
(, 2013-07-01) ~Resource Index Routes calls based on spoken company name, south african names, ses ASR name re ??, which domain) directory assistance, 08606companycall.co.za. -
Core technologies for conjunctively written South African languages
(North-West University, Centre for Language Technology (CTexT), 2021-03-31)During this SADiLaR funded project, enriched corpora for the four official South African languages with a conjunctive orthography, i.e. isiNdebele ... -
Corpus of multilingual code-switched soap opera speech
(Stellenbosch University, 2020-02-28)The corpus comprises 26.9 hours of annotated multilingual speech that contains examples of code-switching in isiZulu, isiXhosa, Setswana, Sesotho and ... -
CorpusCatcher
(Translate.org.za, 2015-01-28) ~Resource Index Corpus Catcher is a tool that is designed to crawl the web to retrieve data for inclusion in a corpus. It makes use of seed documents/wordlists to ... -
COVID-19 Multilingual Terminology
(City of Tshwane; South African Centre for Digital Language Resources (SADiLaR); Department of Science and Innovation; Pan South African Language Board (PanSALB), 2021-07)COVID-19 multilingual terminology list document in all the South African languages. The development of this terminology list was initiated by City of ... -
CTexT Afrikaans fastText CBoW String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans fastText CBoW String Embeddings is a 300 dimensional Afrikaans embedding model based on the Contunious Bag of Words fastText ... -
CTexT Afrikaans FLAIR Named Entity Recognition model
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans FLAIR Named Entity Recognition model is a neural NER model based on the FLAIR framework (Akbik et al. 2019), and includes Afrikaans ... -
CTexT Afrikaans FLAIR Part of Speech tagger model
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans FLAIR Part of Speech tagger model is a neural part of speech tagger model based on the FLAIR framework (Akbik et al. 2019), and ... -
CTexT Afrikaans FLAIR String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans FLAIR String Embeddings are two Afrikaans embedding models based on the FLAIR architecture (Akbik et al. 2018, 2019) that provides ...