Show simple item record

UNISA Multilingual Corpus
The resource comprises a diverse selection of TEI P5 marked up documents from institutional origin, written in Tswana, Afrikaans and English. This is an ongoing project. Sources include the Unisa Press 140 Heritage Collection, a private collection from Unisa Press, as well as the Unisa Library.
Gideon Kotzé
kotzegj1@unisa.ac.za
University of South Africa
not applicable
Afrikaans; English; Setswana
multilingual corpus; TEI P5 markup; Tswana; Afrikaans; English
https://hdl.handle.net/20.500.12185/490
Text
Data
Multilingual text
2.5MB
0.0.1
332311 words (untokenized)
UTF8
not yet determined
Resource Index
afr; eng; tsn
2019-02-01T05:49:36Z
2019-02-01T05:49:36Z
2018-02-28


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Resource Index [412]
    A collection of language resource metadata mostly collected during the NHN funded technology audit of 2009, as well as the SADiLaR technology audit of 2018. Not all resources in this collection are available for download.

Show simple item record