Mburisano Covid-19 multilingual corpus
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Download
MD5: 3f955be17ee38838ae8fd21ae2fd8990
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
MD5: 50427aa2a043b5ca8c4bf0a46af93268
License agreement
By downloading this resource I accept and agree to the terms of use and the associated license conditions under which the resource is distributed.
Collections
- Resource Index [412]
Author(s)
Marais, Laurette
Metadata
Show full item recordDescription
This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, Mburisano, was funded by the Department of Sport, Arts and Culture (DSAC).
A selection of English sentences was generated in consultation with medical domain experts, and these sentences were manually translated into all official South African languages. The sentences formed the basis of the rapid development of Grammatical Framework (GF) application grammars for all the languages, to aid spoken communication about Covid-19 with a particular focus on screening and triage. The corpus is presented as a limited domain, manually translated parallel corpus in all 11 official South African languages.
The AwezaMed Covid-19 application can be found [here](https://play.google.com/store/apps/details?id=za.co.aweza.covid19&gl=ZA).
Contact person
Laurette MaraisContact person's e-mail address
laurette.p@gmail.comPublisher(s)
CSIR Voice Computing