Repository logoRepository logo
 

Mburisano Covid-19 multilingual corpus

Loading...
Thumbnail Image

Deposit Licenses

Date

2020-12-04

Authors

Marais, Laurette

Journal Title

Journal ISSN

Volume Title

Publisher

CSIR Voice Computing

Abstract

Description

This corpus was created to aid development of the AwezaMed Covid-19 speech-to-speech mobile application. The project within which it was created, Mburisano, was funded by the Department of Sport, Arts and Culture (DSAC). A selection of English sentences was generated in consultation with medical domain experts, and these sentences were manually translated into all official South African languages. The sentences formed the basis of the rapid development of Grammatical Framework (GF) application grammars for all the languages, to aid spoken communication about Covid-19 with a particular focus on screening and triage. The corpus is presented as a limited domain, manually translated parallel corpus in all 11 official South African languages. The AwezaMed Covid-19 application can be found [here](https://play.google.com/store/apps/details?id=za.co.aweza.covid19&gl=ZA).

Keywords

Citation

Collections

Verification status

Level 0