AwezaMed automatic speech recognition (ASR) test data

Creative Commons Attribution 3.0 Unported (CC BY 3.0): https://creativecommons.org/licenses/by/3.0/legalcodeBandehorst, JacoVan Niekerk, NinaCalteaux, Karen2025-02-122025-02-122020-12https://hdl.handle.net/20.500.12185/688The corpus contains orthographically transcribed broadband speech in four official languages of South Africa: Afrikaans, English, isiXhosa and isiZulu. Respondents read a number of ASR prompts (10 or 20) in a real-world environment. Dataset includes 1 hour of test dataASRTest dataSpeech corporaAwezaMedAwezaMed automatic speech recognition (ASR) test data