Search

Now showing items 1-10 of 49

NCHLT Optical Character Recognition for South African Languages

Martin Puttkammer; Justin Hocking; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2017-02-23) ~ Resource Catalogue

An OCR system is an application that enables one to convert scanned paper documents into editable and searchable texts. The engine analyses the structure ...

Lwazi isiZulu TTS corpus

Daniel van Niekerk; Etienne Barnard; Marelie Davel; Aby Louw; Alta de Waal (Meraka Institute, CSIR, 2013-03-27) ~ Resource Catalogue

Orthographic and phonemically aligned transcriptions

African Speech Technology isiZulu Text Corpus

CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~ Resource Catalogue

Monolingual text corpus developed during the African Speech Technology project.

NCHLT Tagger

Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

A graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:\n* Part of Speech\n* Named ...

NCHLT isiZulu Phrase Chunk Annotated Corpus

M. Setaka; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

Phrase chunk annotated data for the NCHLT Text Resource Development: Phase II Project. The phrase chunk annotated data is a subset of the 50,000 tokens ...

NCHLT South African Language Identifier

Martin Puttkammer; Justin Hocking; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

A graphical user interface and command line tool to automatically classify a document, paragraph, sentence or phrase as one of the eleven official South ...

Autshumato TMS

Martin Schlemmer; Wildrich Fourie; Werner Liebenberg; Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue

Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.

Autshumato PDF Text Extractor

Wildrich Fourie (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue

Utility application for extracting text out of a PDF document. The pages can also be extracted as images.

NCHLT-inlang Pronunciation Dictionaries

Marelie Davel (Meraka Institute, CSIR; North-West University, 2014-07-04) ~ Resource Catalogue

Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations ...

Autshumato English-isiZulu Translation Memory

Cindy McKellar; Marissa Griesel; Handré Groenewald (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue

Translation memory from English (EN-GB) to isiZulu, in the government domain for use in the Autshumato ITE application.

View previous page
1
2
3
4
. . .
5
View next page