Search

Now showing items 1-10 of 36

Lwazi isiNdebele TTS corpus

Daniel van Niekerk; Etienne Barnard; Marelie Davel; Aby Louw; Alta de Waal (Meraka Institute, CSIR, 2013-03-27) ~ Resource Catalogue

Orthographic and phonemically aligned transcriptions

NCHLT Optical Character Recognition for South African Languages

Martin Puttkammer; Justin Hocking; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2017-02-23) ~ Resource Catalogue

An OCR system is an application that enables one to convert scanned paper documents into editable and searchable texts. The engine analyses the structure ...

Lwazi II isiNdebele TTS Corpus

Daniel van Niekerk; Georg Schlünz (Meraka Institute, CSIR; North-West University, 2015-11-20) ~ Resource Catalogue

Orthographic and phonemically aligned transcriptions.

NCHLT Tagger

Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

A graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:\n* Part of Speech\n* Named ...

NCHLT South African Language Identifier

Martin Puttkammer; Justin Hocking; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

A graphical user interface and command line tool to automatically classify a document, paragraph, sentence or phrase as one of the eleven official South ...

Autshumato TMS

Martin Schlemmer; Wildrich Fourie; Werner Liebenberg; Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue

Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.

Autshumato PDF Text Extractor

Wildrich Fourie (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue

Utility application for extracting text out of a PDF document. The pages can also be extracted as images.

NCHLT-inlang Pronunciation Dictionaries

Marelie Davel (Meraka Institute, CSIR; North-West University, 2014-07-04) ~ Resource Catalogue

Broad phonemic transcriptions for 15,000 generic words in each of 11 languages. Each dictionary has an associated rule set for generating pronunciations ...

Lara2

Martin Puttkammer; Martin Schlemmer (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Tool for annotating texts with lemma, part of speech and morphological analysis information

NCHLT isiNdebele Lemmatiser

Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Lemmatiser developed during the NCHLT Text project. \n\n Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one ...

View previous page
1
2
3
4
View next page