Browsing Resource Catalogue by Title
Filter by:
Now showing items 64-83 of 350
-
CTexT Afrikaans FLAIR String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans FLAIR String Embeddings are two Afrikaans embedding models based on the FLAIR architecture (Akbik et al. 2018, 2019) that provides ... -
CTexT Afrikaans GloVe Word Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans GloVe Word Embeddings is a 300 dimensional Afrikaans embedding model based on the Global Vectors architecture (Pennington, 2014) ... -
CTexT Alignment Interface
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~Resource Catalogue Utility application for the manual alignment of source texts. -
CTexT Alignment Interface Pro
(North-West University; Centre for Text Technology (CTexT), 2013-06-21) ~Resource Catalogue Utility application for the manual alignment of source texts. Pro version allows for the editing of the segments. -
CTexT fastText Skipgram String Embeddings
(Centre for Text Technology (CTexT), 2022-01-10)The CTexT Afrikaans fastText Skipgram String Embeddings is a 300 dimensional Afrikaans embedding model based on the Skipgram fastText architecture that ... -
CTexTools
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~Resource Catalogue Corpus query and manipulation tool for performing tokenisation and sentencisation; extracting frequency list and word list; searching; and extracting ... -
CTexTools 2
(North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~Resource Catalogue CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ... -
Denominal adjectives in Afrikaans dataset
(South African Centre for Digital Language Resources, 2020-05-15) ~Resource Catalogue This dataset contain a collection of Afrikaans denominal adjectives that were extracted from the Virtual Institute for Afrikaans' corpus portal. The ... -
DictionaryMaker
(Meraka Institute, CSIR, 2013-07-15) ~Resource Catalogue The purpose of the DictionaryMaker system is to facilitate the creation of an electronic pronunciation dictionary in a target language, as originally ... -
Ex Machina: Using NLP and statistical learning models to model eyewitness statements and choosing behaviour
(Sadilar, 2019-05-07)This curated database includes data from various of empirical studies where eyewitness statements and descriptions were collected. The original studies, ... -
Generic Bilingual Academic Wordlist with Definitions
(ICELDA; SADiLaR, 2021)The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ... -
Generic Multilingual Academic Wordlists with Definitions
(SADiLaR; ICELDA, 2022)This multilingual generic academic wordlist has been developed to serve as a resource to students to assist with building a vocabulary and decoding ... -
High quality TTS data for four South African languages (af, st, tn, xh)
(Google; North-West University, 2017) ~Resource Catalogue This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ... -
Human Language Technology Audit 2017/18
(CSIR, 2018-08-31)This document reports on all work conducted in the 2017/18 Audit of human language technology (HLT) resources available in South Africa project. The ... -
isiNdebele Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~Resource Catalogue Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ... -
isiNdebele Genre Classification Corpus
(Trifonius, 2013-06-19) ~Resource Catalogue Contains training and testing data for Genre Classification for isiNdebele. -
isiXhosa Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~Resource Catalogue Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ... -
isiXhosa Genre Classification Corpus
(Trifonius, 2013-06-19) ~Resource Catalogue Contains training and testing data for Genre Classification for isiXhosa. -
isiZulu Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~Resource Catalogue Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ... -
isiZulu Genre Classification Corpus
(Trifonius, 2013-06-19) ~Resource Catalogue Contains training and testing data for Genre Classification for isiZulu.