Repository logoRepository logo
 

South African Broadcast News (SABN) Corpus

dc.contact.emailfdw@sun.ac.za
dc.contact.nameFebe de Wet
dc.databaseMonolingual : Annotated : Unaligned
dc.date.accessioned2019-02-01T05:49:35Z
dc.date.available2019-02-01T05:49:35Z
dc.date.issued2018-02-27
dc.descriptionThe corpus consists of approximately 20 hours of audio recordings from one of the country's main radio news channels, SAFM. Bulletins were broadcast between 1996 and 2006 and are a mix of news-reader speech, interviews, and crossings to reporters
dc.format.size20 hours
dc.identifier.urihttps://hdl.handle.net/20.500.12185/484
dc.language.isoeng
dc.languagesEnglish
dc.media.categorySpeech corpora
dc.media.typeSpeech
dc.publisherStellenbosch University
dc.publisherCSIR
dc.stratumThe data comprises a collection of audio files. Each audio file corresponds to a news bulletin. Transcriptions of the audio are included in the data set in TextGrid format. All the 27 speakers are adults (8 male, 19 female).
dc.subjectbroadcast news transcription
dc.subjectSouth African English
dc.subjectaccents of English
dc.subjectunder-resourced languages
dc.titleSouth African Broadcast News (SABN) Corpus
dc.typeData
dc.version0
local.collection.primaryResource Index

Files

Collections