Search
Now showing items 311-320 of 345
isiNdebele Genre Classification Corpus
(Trifonius, 2013-06-19) ~ - Resource Catalogue
Contains training and testing data for Genre Classification for isiNdebele.
POS annotated corpus with 5 different text types for isiZulu
(Centre for Text Technology (CTexT), 2024-01-31)
This is a POS annotated corpus with 5 different text types for isiZulu.
The text types included are:
- CAPS gr12 (Academic) - https://www.educat ...
POS annotated corpus in 5 different genres for Sepedi
(Centre for Text Technology (CTexT), 2024-01-31)
This corpus contains POS annotated data in 5 different genres for Sepedi.
The text types included are:
- CAPS gr12 (Academic) - https://www.educ ...
Multilingual Linguistic Terminology
(UNISA, 2022-09-20)
Multilingual Linguistic Terminology Project
Termbanks of Linguistic terminology for South African languages
Version 1.0
https://linguistictermino ...
Ex Machina: Using NLP and statistical learning models to model eyewitness statements and choosing behaviour
(Sadilar, 2019-05-07)
This curated database includes data from various of empirical studies where eyewitness statements and descriptions were collected. The original studies, ...
Morphologically annotated corpus for isiNdebele
(Centre for Text Technology (CTexT), 2024-01-31)
NCHLT corpus of morphologically annotated tokens in isiNdebele converted to the tags used during phases 1 and 2 of the SADiLaR-II project.
The data ...
Morphologically annotated corpus for isiXhosa
(Centre for Text Technology (CTexT), 2024-01-31)
NCHLT corpus of morphologically annotated tokens in isiXhosa converted to the tags used during phases 1 and 2 of the SADiLaR-II project.
The data is ...
Morphologically annotated corpus for isiZulu
(Centre for Text Technology (CTexT), 2024-01-31)
NCHLT corpus of morphologically annotated tokens in isiZulu converted to the tags used during phases 1 and 2 of the SADiLaR-II project.
The data is ...
Morphologically annotated corpus for Sesotho
(Centre for Text Technology (CTexT), 2024-01-31)
NCHLT corpus of morphologically annotated tokens in Sesotho converted to the tags used during phases 1 and 2 of the SADiLaR-II project.
The data is ...
Morphologically annotated corpus for Siswati
(Centre for Text Technology (CTexT), 2024-01-31)
NCHLT corpus of morphologically annotated tokens in Siswati converted to the tags used during phases 1 and 2 of the SADiLaR-II project.
The data is ...