UNISA English/Zulu Parallel Corpus

Title	UNISA English/Zulu Parallel Corpus
Description	The resource comprises sentence aligned and tokenized parallel text in English and Zulu. The text was extracted from the following sources: an adapted version of the English/Zulu Autshumato corpus, paragraph translated Wikipedia texts, the Bible, the Book of Mormon, the Constitution of South Africa, the Universal Declaration of Human Rights and a selection of translated sentences from the book "Beyond the He/Man" (1996).
Contact name	Gideon Kotzé
Contact email	kotzegj1@unisa.ac.za
Publisher(s)	University of South Africa
License	All rights reserved
Language(s)	English; isiZulu
Subject	parallel corpus; English; Zulu
URI	https://hdl.handle.net/20.500.12185/489
Media type	Text
Type	Data
Media category	Multilingual text
Format extent	15MB
Version	0.0.1
Format size	Token count: English = 1,490,368; Zulu = 1009820
Format medium	UTF8
Stratum	yet to be determined
Primary collection	Resource Index
ISO639 code	eng; zul
Submit date	2019-02-01T05:49:36Z
Date available	2019-02-01T05:49:36Z
Date created	2018-02-28

Files in this item

Files	Size	Format	View
There are no files associated with this item.

Resource Index [412]
A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.