NCHLT Afrikaans Speech Corpus
Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/280'
dc.contact.email | KCalteaux@csir.co.za | |
dc.contact.name | Karen Calteaux | |
dc.contributor.author | Charl van Heerden | |
dc.contributor.author | Etienne Barnard | |
dc.contributor.author | Jaco Badenhorst | |
dc.contributor.author | Marelie Davel | |
dc.contributor.author | Alta de Waal | |
dc.contributor.other | Willem Basson | |
dc.contributor.other | Nic de Vries | |
dc.contributor.other | Febe de Wet | |
dc.contributor.other | Thipe Modipa | |
dc.contributor.other | Gehard van Huyssteen | |
dc.database | Monolingual Speech Corpora: Annotated | |
dc.date.accessioned | 2018-02-06T08:51:42Z | |
dc.date.accessioned | 2018-03-05T17:34:09Z | |
dc.date.available | 2018-02-06T08:51:42Z | |
dc.date.available | 2018-03-05T17:34:09Z | |
dc.date.issued | 2014-07-08 | |
dc.description | Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. | |
dc.format.extent | 4.6 Gb | |
dc.format.medium | UTF8 | |
dc.format.medium | 16 kHz | |
dc.format.medium | 16 bit | |
dc.format.size | 2.348611111 | |
dc.identifier.citation | N.J. de Vries, M.H. Davel, J. Badenhorst, W.D. Basson, F. de Wet, E. Barnard and A. de Waal, "A smartphone-based ASR data collection tool for under-resourced languages", Speech Communication, Volume 56, January 2014, pp 119–131. | |
dc.identifier.islrn | 644-175-105-852-3 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12185/280 | |
dc.language.iso | afr | |
dc.languages | Afrikaans | |
dc.media.category | Monolingual speech corpora: Annotated | |
dc.media.type | Speech | |
dc.project | NCHLT Speech | |
dc.publisher | Meraka Institute, CSIR | |
dc.publisher | North-West University | |
dc.rights.license | Creative Commons Attribution 3.0 Unported License (CC BY 3.0): http://creativecommons.org/licenses/by/3.0/legalcode | |
dc.source | Audio recordings smartphone-collected in non-studio environment | |
dc.source | Text prompts from various sources, predominantly from .gov.za (web) | |
dc.stratum | 210 speakers (98 female/112 male). Prompted speech (3-5 word utterances read from a smartphone screen) | |
dc.title | NCHLT Afrikaans Speech Corpus | |
dc.type | Data | |
dc.version | 1 | |
local.collection.primary | Resource Catalogue | |
local.collection.secondary | Resource Index |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- nchlt.speech.corpus.afr.zip
- Size:
- 4.51 GB
- Format:
- ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.