High quality TTS data for four South African languages (af, st, tn, xh)
Title | High quality TTS data for four South African languages (af, st, tn, xh) |
Description | This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. The data set consists of wave files, and a TSV file transcribing the audio. In each folder, the file line_index.tsv contains a FileID, which in turn contains the UserID and the Transcription of audio in the file. The data set has had some quality checks, but there might still be errors. This data set was collected by as a collaboration between North-West University and Google. See LICENSE.txt file for license information. Copyright 2017 Google, Inc. |
Contact name | Daniel Povey |
Contact email | dpovey@gmail.com |
Publisher(s) | Google; North-West University |
License | Attribution-ShareAlike 4.0 International (CC BY-SA 4.0): https://creativecommons.org/licenses/by-sa/4.0/ |
Language(s) | Afrikaans; isiXhosa; Setswana; Sesotho |
Contributor | Google; North-West University |
Subject | TTS |
URI | https://hdl.handle.net/20.500.12185/527 |
Media type | Speech |
Media category | Multilingual Speech Corpus |
Format extent | Audio files |
Version | 1 |
Format size | 3.31GB |
Format medium | TSV; WAV |
Project | OpenSLR (Open Speech and Language Resources) |
Primary collection | Resource Catalogue |
Secondary collection | Resource Index |
ISO639 code | afr; xho; sot; tsn |
Submit date | 2020-01-14T09:53:43Z |
Date available | 2020-01-14T09:53:43Z |
Date created | 2017 |
Files in this item
This item appears in the following Collection(s)
-
Resource Catalogue [350]
A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture. -
Resource Index [412]
A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.