Resource Index: Recent submissions
Now showing items 31-40 of 414
-
Monolingual Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ... -
Sesotho syllabification systems
(South African Centre for Digital Language Resources, 2022-02-03)This package contains two syllabification systems for Sesotho (rule-based and TeX-based). -
Sesotho syllable wordlist
(South African Centre for Digital Language Resources, 2022-02-03)This package contains a wordlist containing Sesotho words and their syllable information. -
Linguistically enriched corpora for conjunctively written South African languages
(North-West University, Centre for Language Technology (CTexT), 2021-09)This resource contains linguistically annotated data for four official South African languages with a conjunctive orthography from the Nguni family ... -
Description of N|uu
(Bonny Sands, 2015-10-06)Recordings of dictionary entries for a pan-dialectal dictionary of the N|uu language (Eastern and Western dialects) made by Bonny Sands, Johanna Brugman, ... -
Mburisano Covid-19 multilingual corpus
(CSIR Voice Computing, 2020-12-04)This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, ... -
Denominal adjectives in Afrikaans dataset
(South African Centre for Digital Language Resources, 2020-05-15) ~Resource Catalogue This dataset contain a collection of Afrikaans denominal adjectives that were extracted from the Virtual Institute for Afrikaans' corpus portal. The ... -
Representations of epistemological certainty and ontological ambiguity in selected earlier works by Joseph Conrad
(North-West University, 2019-02-18) ~Resource Catalogue Representations of epistemological certainty and ontological ambiguity in selected earlier works by Joseph Conrad -
SPCS Speech Corpus
(Council for Scientific and Industrial Research; North-West University, 2015-11-25) ~Resource Catalogue Broadband speech corpus of approximately 10 hours and the corresponding transcriptions. The development process of the corpus involved the recording ... -
Speech transcription platform user interface
(Multilingual Speech Technologies, North-West University, 2017) ~Resource Index This is the user interface component of the Speech Transcription Platform developed by the Multilingual Speech Technologies group at North-West University ...