Repository logoRepository logo
 

High quality TTS data for four South African languages (af, st, tn, xh)

Loading...
Thumbnail Image

Deposit Licenses

Date

2017

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Google
North-West University

Abstract

Description

This data set contains multi-speaker TTS high quality transcribed audio data for four languages of South Africa: Afrikaans, Sesotho, Setswana and isiXhosa. The data set consists of wave files, and a TSV file transcribing the audio. In each folder, the file line_index.tsv contains a FileID, which in turn contains the UserID and the Transcription of audio in the file. The data set has had some quality checks, but there might still be errors. This data set was collected by as a collaboration between North-West University and Google. See LICENSE.txt file for license information. Copyright 2017 Google, Inc.

Keywords

Citation

Verification status

Level 0