Search

Now showing items 11-20 of 47

NCHLT Sesotho Speech Corpus

Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel; Alta de Waal (Meraka Institute, CSIR; North-West University, 2014-07-08) ~ Resource Catalogue

Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.

South African Multilingual Proper Names (Multipron) Corpus

Etienne Barnard; Marelie Davel; Oluwapelumi Giwa; Nadia Barnard; Jean-Pierre Martens; Derik Thirion (Molo Afrika Speech Technologies, 2013-10-03) ~ Resource Catalogue

Audio, orthographic and auditory verified broad phonemic transcriptions of proper names in four languages, produced by speakers of the same four languages.

Lara2

Martin Puttkammer; Martin Schlemmer (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Tool for annotating texts with lemma, part of speech and morphological analysis information

Lwazi Sesotho ASR corpus

Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel (Meraka Institute, CSIR, 2013-04-02) ~ Resource Catalogue

Complete audio recordings and orthographic transcriptions used for Lwazi speech recognition systems.

NCHLT Sesotho Annotated Text Corpora

Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.

High quality TTS data for four South African languages (af, st, tn, xh)

Unknown author (Google; North-West University, 2017) ~ Resource Catalogue

This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. ...

Lwazi II Sesotho TTS Corpus

Daniel van Niekerk; Georg Schlünz (Meraka Institute, CSIR; North-West University, 2015-11-20) ~ Resource Catalogue

Orthographic and phonemically aligned transcriptions.

SADE Municipality Hotline IVR Prompts

Charl van Heerden; Marelie Davel (North-West University; Molo Afrika Speech Technologies; IntSyst Labs CC, 2015-09-07) ~ Resource Catalogue

Audio and corresponding transcriptions for the SADE Municipality Hotline IVR prompts in English, Sesotho and isiZulu. The English SADE municipality ...

Martin Schlemmer; Wikus Pienaar; Wildrich Fourie; Ismail Lavangee; Cindy McKellar; Gordon Matthews; Marissa Griesel (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue

Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...

NCHLT Sesotho Text Corpora

Martin Puttkammer; Martin Schlemmer; Wikus Pienaar; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...

View previous page
1
2
3
4
5
View next page

Search

Filters

Filter options