Resource Index: Recent submissions
Now showing items 101-110 of 414
-
NCHLT Tshivenda Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT isiZulu Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT English Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT Setswana Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT Afrikaans Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT isiXhosa Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT Sesotho Speech Corpus
(Meraka Institute, CSIR; North-West University, 2014-07-08) ~Resource Catalogue Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers. -
NCHLT isiZulu Lemmatiser
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~Resource Catalogue Lemmatiser developed during the NCHLT Text project. \n\n Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one ... -
NCHLT Afrikaans Lemmatiser
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~Resource Catalogue Lemmatiser developed during the NCHLT Text project. \n\n Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one lowercase ... -
NCHLT Part of Speech Taggers
(North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~Resource Catalogue Part of speech taggers developed during the NCHLT Text project. Available for the following languages: Afrikaans, English, isiNdebele, isiXhosa, isiZulu, ...