High quality TTS data for four South African languages (af, st, tn, xh)
Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/527'
dc.contact.email | dpovey@gmail.com | en_ZA |
dc.contact.name | Daniel Povey | en_ZA |
dc.contributor.other | ||
dc.contributor.other | North-West University | |
dc.date.accessioned | 2020-01-14T09:53:43Z | |
dc.date.available | 2020-01-14T09:53:43Z | |
dc.date.issued | 2017 | |
dc.description | This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. The data set consists of wave files, and a TSV file transcribing the audio. In each folder, the file line_index.tsv contains a FileID, which in turn contains the UserID and the Transcription of audio in the file. The data set has had some quality checks, but there might still be errors. This data set was collected by as a collaboration between North-West University and Google. See LICENSE.txt file for license information. Copyright 2017 Google, Inc. | en_ZA |
dc.format | tar.gz | en_ZA |
dc.format.extent | Audio files | en_ZA |
dc.format.medium | TSV | en_ZA |
dc.format.medium | WAV | en_ZA |
dc.format.size | 3.31GB | en_ZA |
dc.identifier.other | SLR32 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12185/527 | |
dc.language.iso | afr | |
dc.language.iso | xho | |
dc.language.iso | sot | |
dc.language.iso | tsn | |
dc.languages | Afrikaans | en_ZA |
dc.languages | isiXhosa | en_ZA |
dc.languages | Setswana | en_ZA |
dc.languages | Sesotho | en_ZA |
dc.media.category | Multilingual Speech Corpus | en_ZA |
dc.media.type | Speech | en_ZA |
dc.project | OpenSLR (Open Speech and Language Resources) | en_ZA |
dc.publisher | en_ZA | |
dc.publisher | North-West University | en_ZA |
dc.rights.license | Attribution-ShareAlike 4.0 International (CC BY-SA 4.0): https://creativecommons.org/licenses/by-sa/4.0/ | en_ZA |
dc.subject | TTS | en_ZA |
dc.title | High quality TTS data for four South African languages (af, st, tn, xh) | en_ZA |
dc.version | 1 | en_ZA |
local.collection.primary | Resource Catalogue | |
local.collection.secondary | Resource Index | |
local.url | http://openslr.org/32/ | en_ZA |
Files
Original bundle
1 - 4 of 4
Loading...
- Name:
- af_za.tar.gz
- Size:
- 906.78 MB
- Format:
- Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
- Description:
- Audio files and transcriptions for Afrikaans
Loading...
- Name:
- st_za.tar.gz
- Size:
- 690.87 MB
- Format:
- Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
- Description:
- Audio files and transcriptions for Sesotho
Loading...
- Name:
- tn_za.tar.gz
- Size:
- 695.62 MB
- Format:
- Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
- Description:
- Audio files and transcriptions for Setswana
Loading...
- Name:
- xh_za.tar.gz
- Size:
- 865.46 MB
- Format:
- Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
- Description:
- Audio files and transcriptions for isiXhosa
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 3.23 KB
- Format:
- Item-specific license agreed upon to submission
- Description: