Show simple item record

RSG corpus
The resource consists of radio news bulletins broadcast between 2001 and 2004. The collection contains around 330 bulletins, corresponding to approximately 27 hours of audio data.
Febe de Wet
fdw@sun.ac.za
Stellenbosch University; CSIR
Afrikaans
automatic speech recognition; under-resourced languages; Afrikaans; speech resources
https://hdl.handle.net/20.500.12185/492
Speech
Data
Speech corpora
5 GB
0
27 hours
The data comprises a collection of audio files. Each audio file corresponds to an utterance from a news bulletin. Transcriptions of the audio are included in the data set in TextGrid format. All 28 speakers are adults (18 male, 10 female).
Monolingual : Annotated : Unaligned
Resource Index
afr
2019-02-01T05:49:36Z
2019-02-01T05:49:36Z
2018-02-28


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Resource Index [412]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record