Search

Now showing items 1-10 of 58

African Speech Technology Coloured-Afrikaans Speech Corpus

CatchWord Language and Speech Technologies (Pty) Ltd (North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2014-12-11) ~ Resource Catalogue

African Speech Technology speech and transcription data for the Coloured-Afrikaans database. The "speech" directory contains Afrikaans speech as spoken ...

NCHLT Optical Character Recognition for South African Languages

Martin Puttkammer; Justin Hocking; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2017-02-23) ~ Resource Catalogue

An OCR system is an application that enables one to convert scanned paper documents into editable and searchable texts. The engine analyses the structure ...

AuCoPro Splitting Dataset

Gerhard van Huyssteen; Menno van Zaanen (North-West University; Centre for Text Technology (CTexT); Tilburg Centre for Cognition and Communication, 2015-01-07) ~ Resource Catalogue

The AuCoPro Splitting dataset contains compounds annotated with their compound boundaries and linking morphemes for Afrikaans and Dutch.

NCHLT Tagger

Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

A graphical user interface and command line tool to automatically annotate running text with one or more linguistic tags:\n* Part of Speech\n* Named ...

Afrikaans linking element dataset

Trollip, EB (North-West University, 2019) ~ Resource Catalogue

(Afrikaans follows English) This data set was compiled for a study in which the possible semantic content of Afrikaans linking elements was investigated. ...

NCHLT South African Language Identifier

Martin Puttkammer; Justin Hocking; Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2016-04-29) ~ Resource Catalogue

A graphical user interface and command line tool to automatically classify a document, paragraph, sentence or phrase as one of the eleven official South ...

NCHLT Afrikaans Lemmatiser

Martin Puttkammer; Martin Schlemmer; Ruan Bekker (North-West University; Centre for Text Technology (CTexT), 2014-05-30) ~ Resource Catalogue

Lemmatiser developed during the NCHLT Text project. \n\n Available in the Readme.txt - Input format: Text data (encoding: UTF8 without BOM), one lowercase ...

African Speech Technology Black-Afrikaans Speech Corpus

African Speech Technology speech and transcription data for the Black-Afrikaans database. The "speech" directory contains Afrikaans speech as spoken ...

Autshumato TMS

Martin Schlemmer; Wildrich Fourie; Werner Liebenberg; Ismail Lavangee (North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ Resource Catalogue

Terminology Management System. Web application used by Terminologists and Administrators to capture, edit and export terminology.

Autshumato PDF Text Extractor

Wildrich Fourie (North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ Resource Catalogue

Utility application for extracting text out of a PDF document. The pages can also be extracted as images.

View previous page
1
2
3
4
. . .
6
View next page