Browsing Resource Catalogue by Media type "Text"
Filter by:
Now showing items 1-20 of 239
-
Afribooms Afrikaans Dependency Treebank
(North-West University; Centre for Text Technology (CTexT); Katholieke Universiteit Leuven (Belgium), 2015-02-10) ~Resource Catalogue This is the annotated corpus developed for Afrikaans for the Afribooms project. The corpus includes annotations for lemma, part-of-speech (POS) and ... -
African Speech Technology Afrikaans Text Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~Resource Catalogue Monolingual text corpus developed during the African Speech Technology project. -
African Speech Technology English Text Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~Resource Catalogue Monolingual text corpus developed during the African Speech Technology project. -
African Speech Technology isiXhosa Text Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~Resource Catalogue Monolingual text corpus developed during the African Speech Technology project. -
African Speech Technology isiZulu Text Corpus
(North-West University; Stellenbosch University; University of Transkei; University of Free State (Qwa-Qwa campus); Rhodes University; University of KwaZulu-Natal; University of Western Cape, 2015-01-07) ~Resource Catalogue Monolingual text corpus developed during the African Speech Technology project. -
African Wordnet version 1.0
(UNISA, 2022-09-20)Developed using the expand model with Princeton WordNet 3.1 as basis. Please see https://africanwordnet.wordpress.com/ for all details on the project. ... -
African Wordnet: isiXhosa 1.0
(UNISA, 2017-06-20) ~Resource Catalogue Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ... -
African Wordnet: isiZulu 1.0
(UNISA, 2017-06-20) ~Resource Catalogue Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ... -
African Wordnet: Sesotho sa Leboa 1.0
(UNISA, 2017-06-20) ~Resource Catalogue Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ... -
African Wordnet: Setswana 1.0
(UNISA, 2017-06-20) ~Resource Catalogue Developed using the expand model with Princeton WordNet 2.0 as basis.Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ... -
African Wordnet: Tshivenda 1.0
(UNISA, 2017-06-20) ~Resource Catalogue Developed using the expand model with Princeton WordNet 2.0 as basis. Each wordnet contains synsets with at least the following fields:\nWord form (lemma; ... -
Afrikaans Custom Dictionary for Government Domain
(North-West University; Centre for Text Technology (CTexT), 2013-02-22) ~Resource Catalogue Word list developed as a custom dictionary for use in the spelling checkers as part of the spelling checker project for the Department of Arts and ... -
Afrikaans Genre Classification Corpus
(Trifonius, 2013-06-19) ~Resource Catalogue Contains training and testing data for Genre Classification for Afrikaans. -
Afrikaans linking element dataset
(North-West University, 2019) ~Resource Catalogue (Afrikaans follows English) This data set was compiled for a study in which the possible semantic content of Afrikaans linking elements was investigated. ... -
Afrikaans speaking children's first lexical items
(North-West University, 2018-05-17) ~Resource Catalogue Data collected for a master's study in Afrikaans linguistics. The data consist of the first lexical items of 21 Afrikaans speaking children. The lexical ... -
Afrikaans text unit identification data
(Centre for Text Technology, North-West University, 2006) ~Resource Catalogue This dataset was developed during a masters degree and used in the development of a text unit identifier capable of tagging sentences, named-entities, ... -
AuCoPro Semantics Dataset
(North-West University; Centre for Text Technology (CTexT); CLiPS Research Center, University of Antwerp, Belgium, 2015-01-07) ~Resource Catalogue The AuCoPro Semantics dataset serves for the automatic semantic analysis of compounds. It contains semantically annotated noun-noun compounds (NN) from ... -
AuCoPro Splitting Dataset
(North-West University; Centre for Text Technology (CTexT); Tilburg Centre for Cognition and Communication, 2015-01-07) ~Resource Catalogue The AuCoPro Splitting dataset contains compounds annotated with their compound boundaries and linking morphemes for Afrikaans and Dutch. -
Autshumato Afrikaans-English Translation Memory
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Translation memory from Afrikaans to English (EN-GB), in the government domain for use in the Autshumato ITE application. -
Autshumato English-Afrikaans Parallel Corpora
(North-West University; Centre for Text Technology (CTexT), 2013-06-19) ~Resource Catalogue Parallel corpora aligned on sentence level through a combination of automatic and manual alignment techniques. The parallel corpora were obtained from ...