Search
Now showing items 31-40 of 83
DSAE Citations Database
(Dictionary Unit for South African Languages, 2015-01-28) ~ - Resource Index
Citations contained in A Dictionary of South African English on Historical Principles (1996) covering English usage in SA up to 1995, plus new material ...
Autshumato English-Setswana Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2016-10-28) ~ - Resource Catalogue
Aligned English-Setswana parallel corpus. This set contains data that was translated by professional translators, data that was sourced as translated ...
Multilingual Natural Sciences & Technology Terminology List (Grade 4 - 6)
(Terminology Coordination Section of the National Language Service, Department of Arts and Culture, 2017-03-03) ~ - Resource Index
2756 English source terms with their equivalents in the ten other official South African languages. The list was populated from terms excerpted from ...
Autshumato Text Anonymiser
(North-West University; Centre for Text Technology (CTexT), 2013-06-20) ~ - Resource Catalogue
Anonymises text by classifying and replacing sensitive information such as person names, business names, place names, monetary values, phone numbers, ...
CTexT Multilingual Text Corpora
(North-West University; Centre for Text Technology (CTexT), 2015-02-03) ~ - Resource Index
Document level aligned corpora for machine translation purposes.
UNISA English/Zulu Parallel Corpus
(University of South Africa, 2018-02-28) ~ - Resource Index
The resource comprises sentence aligned and tokenized parallel text in English and Zulu. The text was extracted from the following sources: an adapted ...
Autshumato English-isiZulu Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~ - Resource Catalogue
Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...
SAE Newspaper Text Corpus
(Stellenbosch University, 2015-01-27) ~ - Resource Index
Newspaper text in electronic format obtained from Avusa Media through a licensing agreement (renewed anually).
Generic Bilingual Academic Wordlist with Definitions
(ICELDA; SADiLaR, 2021)
The academic wordlist has been developed to serve as a resource to students to assist them to better understand words used within the information they ...
Bilingual English-isiXhosa corpus
(North-West University - Centre for Text Technology (CTexT), 2019-11-30) ~ - Resource Catalogue
Aligned parallel corpora for the following language pair: English-isiXhosa.
The data is given as two separate UTF-8 text files, with each segment on a ...