Repository logoRepository logo
 

High quality TTS data for four South African languages (af, st, tn, xh)

dc.contact.emaildpovey@gmail.comen_ZA
dc.contact.nameDaniel Poveyen_ZA
dc.contributor.otherGoogle
dc.contributor.otherNorth-West University
dc.date.accessioned2020-01-14T09:53:43Z
dc.date.available2020-01-14T09:53:43Z
dc.date.issued2017
dc.descriptionThis data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. The data set consists of wave files, and a TSV file transcribing the audio. In each folder, the file line_index.tsv contains a FileID, which in turn contains the UserID and the Transcription of audio in the file. The data set has had some quality checks, but there might still be errors. This data set was collected by as a collaboration between North-West University and Google. See LICENSE.txt file for license information. Copyright 2017 Google, Inc.en_ZA
dc.formattar.gzen_ZA
dc.format.extentAudio filesen_ZA
dc.format.mediumTSVen_ZA
dc.format.mediumWAVen_ZA
dc.format.size3.31GBen_ZA
dc.identifier.otherSLR32
dc.identifier.urihttps://hdl.handle.net/20.500.12185/527
dc.language.isoafr
dc.language.isoxho
dc.language.isosot
dc.language.isotsn
dc.languagesAfrikaansen_ZA
dc.languagesisiXhosaen_ZA
dc.languagesSetswanaen_ZA
dc.languagesSesothoen_ZA
dc.media.categoryMultilingual Speech Corpusen_ZA
dc.media.typeSpeechen_ZA
dc.projectOpenSLR (Open Speech and Language Resources)en_ZA
dc.publisherGoogleen_ZA
dc.publisherNorth-West Universityen_ZA
dc.rights.licenseAttribution-ShareAlike 4.0 International (CC BY-SA 4.0): https://creativecommons.org/licenses/by-sa/4.0/en_ZA
dc.subjectTTSen_ZA
dc.titleHigh quality TTS data for four South African languages (af, st, tn, xh)en_ZA
dc.version1en_ZA
local.collection.primaryResource Catalogue
local.collection.secondaryResource Index
local.urlhttp://openslr.org/32/en_ZA

Files

Original bundle

Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
af_za.tar.gz
Size:
906.78 MB
Format:
Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
Description:
Audio files and transcriptions for Afrikaans
Loading...
Thumbnail Image
Name:
st_za.tar.gz
Size:
690.87 MB
Format:
Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
Description:
Audio files and transcriptions for Sesotho
Loading...
Thumbnail Image
Name:
tn_za.tar.gz
Size:
695.62 MB
Format:
Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
Description:
Audio files and transcriptions for Setswana
Loading...
Thumbnail Image
Name:
xh_za.tar.gz
Size:
865.46 MB
Format:
Hypothetically, if a tarball were an official media type and following conventions, its MIME type would be application/tar (file extension .tar) and its compressed version would be application/tar+gzip (file extensions .tar.gz and .tgz).
Description:
Audio files and transcriptions for isiXhosa

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.23 KB
Format:
Item-specific license agreed upon to submission
Description: