Repository logoRepository logo
 

AwezaMed automatic speech recognition (ASR) test data

Loading...
Thumbnail Image

Deposit Licenses

Date

2020-12

Authors

Bandehorst, Jaco

Journal Title

Journal ISSN

Volume Title

Publisher

Voice Computing (VC) Research Group at the CSIR Nextgen Enterprises and Institutions (NGEI)

Abstract

Description

The corpus contains orthographically transcribed broadband speech in four official languages of South Africa: Afrikaans, English, isiXhosa and isiZulu. Respondents read a number of ASR prompts (10 or 20) in a real-world environment. Dataset includes 1 hour of test data

Citation

Collections

Verification status

Level 0