Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/530'
SPCS Speech Corpus
Loading...
Deposit Licenses
Date
2015-11-25
Authors
Modipa, T. I.
Davel, M. H.
De Wet, F.
Journal Title
Journal ISSN
Volume Title
Publisher
Council for Scientific and Industrial Research
North-West University
North-West University
Abstract
Description
Broadband speech corpus of approximately 10 hours and the corresponding transcriptions.
The development process of the corpus involved the recording and transcribing of radio broadcasts. The transcriptions were used to generate the Sepedi code-switched prompts to re-record speech from multiple speakers.
The following sub-directories are found in this directory:
Audio: Audio files for all the recorded code-switched speech
Transcriptions: The corresponding orthographic transcriptions
Metadata: Information about the speakers and the transcriptions
Documentation: The directory structure and the Sepedi prompt list
Citation
T. I. Modipa, M. H. Davel, F. De Wet, "Implications of Sepedi/English code switching for ASR systems", Pattern Recognition Association of South Africa, pp. 112-117, 2015
Collections
Verification status
Level 0