Search
Now showing items 71-80 of 87
USAf National Language Resources Audit 2023
(South African Centre for Digital Language Resources, 2023-10)
This report documents the findings of a comprehensive language resources audit conducted by the South African Centre for Digital Language Resources ...
Generic Multilingual Academic Wordlists with Definitions
(SADiLaR; ICELDA, 2022)
This multilingual generic academic wordlist has been developed to serve as a resource to students to assist with building a vocabulary and decoding ...
isiNdebele Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~ - Resource Catalogue
Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ...
isiNdebele Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for isiNdebele.
Multilingual Linguistic Terminology
(UNISA, 2022-09-20)
Multilingual Linguistic Terminology Project
Termbanks of Linguistic terminology for South African languages
Version 1.0
https://linguistictermino ...
Morphologically annotated corpus for isiNdebele
(Centre for Text Technology (CTexT), 2024-01-31)
NCHLT corpus of morphologically annotated tokens in isiNdebele converted to the tags used during phases 1 and 2 of the SADiLaR-II project.
The data ...
Autshumato English-isiNdebele Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2021-01-31)
Aligned parallel corpora for the following language pair: English-isiNdebele. Data was crawled from various multilingual government websites, sourced ...
CTexTools 2
(North-West University, Centre for Text Technology (CTexT); South African Department of Arts and Culture, 2018-05-24) ~ - Resource Catalogue
CTexTools is a corpus query and manipulation tool primarily for the official South African languages. The tool supports the creation of frequency and ...
Autshumato Machine Translation Evaluation Set
(North-West University; Centre for Text Technology (CTexT); Department of Arts and Culture, South Africa, 2017-12-15) ~ - Resource Catalogue
Comparable evaluation data for use in automatic machine translation evaluations. The evaluation set consists of 500 sentences translated separately by ...
African Wordnet version 1.0
(UNISA, 2022-09-20)
Developed using the expand model with Princeton WordNet 3.1 as basis.
Please see https://africanwordnet.wordpress.com/ for all details on the project. ...