AwezaMed automatic speech recognition (ASR) test data
Title | AwezaMed automatic speech recognition (ASR) test data |
Description | The corpus contains orthographically transcribed broadband speech in four official languages of South Africa: Afrikaans, English, isiXhosa and isiZulu. Respondents read a number of ASR prompts (10 or 20) in a real-world environment. Dataset includes 1 hour of test data |
Contact name | Karen Calteaux |
Contact email | KCalteaux@csir.co.za |
Publisher(s) | Voice Computing (VC) Research Group at the CSIR Nextgen Enterprises and Institutions (NGEI) |
License | Creative Commons Attribution 3.0 Unported (CC BY 3.0): https://creativecommons.org/licenses/by/3.0/legalcode |
Language(s) | Afrikaans; English; isiXhosa; isiZulu |
Author(s) | Bandehorst, Jaco |
Contributor | Van Niekerk, Nina; Calteaux, Karen |
Subject | ASR; Test data; Speech corpora; AwezaMed |
URI | https://hdl.handle.net/20.500.12185/688 |
Media category | Annotated Speech Corpus |
Submit date | 2025-02-12T11:12:33Z |
Date available | 2025-02-12T11:12:33Z |
Date created | 2020-12 |
Verification status | Level 0 |
Files in this item
This item appears in the following Collection(s)
-
Resource Index [414]
A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.