SPCS Speech Corpus

Modipa, T. I.; Davel, M. H.; De Wet, F.

Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/530'

SPCS Speech Corpus

Files

spcs.tar.gz (867.05 MB)

Deposit Licenses

license.txt (3.23 KB)

Date

2015-11-25

Authors

Modipa, T. I.

Davel, M. H.

De Wet, F.

Publisher

Council for Scientific and Industrial Research
North-West University

Description

Broadband speech corpus of approximately 10 hours and the corresponding transcriptions. The development process of the corpus involved the recording and transcribing of radio broadcasts. The transcriptions were used to generate the Sepedi code-switched prompts to re-record speech from multiple speakers. The following sub-directories are found in this directory: Audio: Audio files for all the recorded code-switched speech Transcriptions: The corresponding orthographic transcriptions Metadata: Information about the speakers and the transcriptions Documentation: The directory structure and the Sepedi prompt list

Keywords

Sepedi, Sesotho sa Leboa, code-switching, orthographic transcription, English

Citation

T. I. Modipa, M. H. Davel, F. De Wet, "Implications of Sepedi/English code switching for ASR systems", Pattern Recognition Association of South Africa, pp. 112-117, 2015

License

Creative Commons Attribution 2.5 South Africa license

URI

https://hdl.handle.net/20.500.12185/530

Collections

Resource Index
Resource Catalogue

Verification status

Level 0

Full item page

SPCS Speech Corpus

Files

Deposit Licenses

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

License

URI

Collections

Verification status