Creative Commons Attribution 2.5 South Africa License: http://creativecommons.org/licenses/by/2.5/za/legalcodeDaniel van NiekerkEtienne BarnardOluwapelumi GiwaAzeez Sosimi2018-02-052018-03-052018-02-052018-03-052015-02-06https://hdl.handle.net/20.500.12185/431This speech corpus consisting of 16 female speakers and 17 male speakers was recorded in Lagos, Nigeria for the purpose of speech recognition research. Each speaker recorded about 130 utterances read from short texts selected for phonetic coverage. Recordings were done using a microphone connected to a laptop computer in a quiet office environment.268 Mb (zipped)UTF8UTF-8 encoded Unicode textRIFF-WAVE 16-bit PCM samples at 16kHz sampling rateyorLagos-NWU Yoruba Speech CorpusData573-526-122-515-8Number of speakers: 33, Number of utterances: 4316, Audio length: 165 mins. (including non-speech segments) Per speaker: approx. 130 utterances amounting to approx. 5 minutes of audio