Search

Now showing items 11-18 of 18

NCHLT Setswana Text Corpora

Martin Puttkammer; Martin Schlemmer; Wikus Pienaar; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Collection of source text documents, genre classified text documents, raw corpus, clean corpus, lexicon, frequency list and named-entity lists developed ...

Setswana Custom Dictionary for Government Domain

Martin Puttkammer; Nico Oosthuizen; Wikus Pienaar (North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ Resource Catalogue

Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...

NCHLT Setswana Speech Corpus

Charl van Heerden; Etienne Barnard; Jaco Badenhorst; Marelie Davel; Alta de Waal (Meraka Institute, CSIR; North-West University, 2014-07-08) ~ Resource Catalogue

Orthographically transcribed broadband speech corpus of approximately 56 hours, including a test suite of 8 speakers.

Autshumato Multilingual Word and Phrase Translations

Martin Puttkammer; Justin Hocking (North-West University; Centre for Text Technology (CTexT), 2016-01-20) ~ Resource Catalogue

Word and phrase lists aligned from English to the other official South African languages.

NCHLT Setswana Phrase Chunk Annotated Corpus

R.S. Pretorius; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...

Lwazi II Sotho Pronunciation Dictionaries

Marelie Davel; Thipe Modipa (Meraka Institute, CSIR; North-West University, 2015-11-20) ~ Resource Catalogue

Pronunciation dictionaries for Sepedi, Sesotho and Setswana with and without affricates, as well as the maps that were used to split the affricates into ...

Setswana Test suite and Treebank

Berg, Ansu (North-West University, 2018-03-27) ~ Resource Catalogue

The main aim of the PhD study "A computational syntactic analysis of Setswana"(AS Berg, May 2018) is the computational syntactic analysis of the Setswana ...

Autshumato Setswana Monolingual Corpora

Cindy McKellar (North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ Resource Catalogue

Setswana monolingual corpus as a deliverable of the Autshumato project. The data is given as a UTF-8 text file; with each sentence on a new line. NOTE: ...

View previous page
1
2