CTexT Afrikaans GloVe Word Embeddings
Title | CTexT Afrikaans GloVe Word Embeddings |
Description | The CTexT Afrikaans GloVe Word Embeddings is a 300 dimensional Afrikaans embedding model based on the Global Vectors architecture (Pennington, 2014) that provides real-valued vector representations for Afrikaans text. The embedding model was trained on a corpus of 230 million words. |
Contact name | Roald Eiselen |
Contact email | Roald.Eiselen@nwu.ac.za |
Publisher(s) | Centre for Text Technology (CTexT) |
License | Creative Commons Attribution-Noncommercial 4.0 International (CC BY-NC 4.0): https://creativecommons.org/licenses/by-nc/4.0/ |
Language(s) | Afrikaans |
Author(s) | Eiselen, Roald |
Contributor | Eiselen, Roald |
Subject | GloVe; Global Vectors; Word embedding |
URI | https://hdl.handle.net/20.500.12185/553 |
Media category | Word embeddings |
Format extent | 230 million words |
Version | 0.1 |
Format size | 1.66 Gb |
Format medium | N/A |
Submit date | 2022-02-03T09:00:19Z |
Date available | 2022-02-03T09:00:19Z |
Date created | 2022-01-10 |
Files in this item
This item appears in the following Collection(s)
-
Resource Catalogue [350]
A collection of language resources available for download from the RMA of SADiLaR. The collection mostly consists of resources developed with funding from the Department of Arts and Culture.