Repository logoRepository logo
 

SPCS Speech Corpus

Loading...
Thumbnail Image

Deposit Licenses

Date

2015-11-25

Authors

Modipa, T. I.
Davel, M. H.
De Wet, F.

Journal Title

Journal ISSN

Volume Title

Publisher

Council for Scientific and Industrial Research
North-West University

Abstract

Description

Broadband speech corpus of approximately 10 hours and the corresponding transcriptions. The development process of the corpus involved the recording and transcribing of radio broadcasts. The transcriptions were used to generate the Sepedi code-switched prompts to re-record speech from multiple speakers. The following sub-directories are found in this directory: Audio: Audio files for all the recorded code-switched speech Transcriptions: The corresponding orthographic transcriptions Metadata: Information about the speakers and the transcriptions Documentation: The directory structure and the Sepedi prompt list

Citation

T. I. Modipa, M. H. Davel, F. De Wet, "Implications of Sepedi/English code switching for ASR systems", Pattern Recognition Association of South Africa, pp. 112-117, 2015

Verification status

Level 0