Search
Now showing items 1-2 of 2
Monolingual Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)
Monolingual corpus for SiSwati. The data is given as a single UTF-8 text file, with each segment on a newline. The dataset contains existing data sourced ...
Bilingual English-Siswati Corpus
(North-West University - Centre for Text Technology (CTexT), 2022-03-31)
Aligned parallel corpora for the following language pair: English-SiSwati. The data is given as four separate UTF-8 text files, with each segment on a ...