Repository logoRepository logo
 

CTexT Afrikaans fastText CBoW String Embeddings

dc.contact.emailRoald.Eiselen@nwu.ac.zaen_ZA
dc.contact.nameRoald Eiselenen_ZA
dc.contributor.authorEiselen, Roald
dc.date.accessioned2022-02-02T06:56:56Z
dc.date.available2022-02-02T06:56:56Z
dc.date.issued2022-01-10
dc.descriptionThe CTexT Afrikaans fastText CBoW String Embeddings is a 300 dimensional Afrikaans embedding model based on the Contunious Bag of Words fastText architecture that provides real-valued vector representations for Afrikaans text. The embedding was trained on a corpus of 230 million words.en_ZA
dc.formatPickle & texten_ZA
dc.format.extent230 million wordsen_ZA
dc.format.mediumN/Aen_ZA
dc.format.size3.31 Gben_ZA
dc.identifier.urihttps://hdl.handle.net/20.500.12185/550
dc.languagesAfrikaansen_ZA
dc.media.categoryWord embeddingen_ZA
dc.media.typeTexten_ZA
dc.publisherCentre for Text Technology (CTexT)en_ZA
dc.rights.licenseCreative Commons Attribution-Noncommercial 4.0 International (CC BY-NC 4.0): https://creativecommons.org/licenses/by-nc/4.0/en_ZA
dc.subjectWord embeddingsen_ZA
dc.subjectString embeddingsen_ZA
dc.subjecten_ZA
dc.subjecten_ZA
dc.subjectfastTexten_ZA
dc.subjecten_ZA
dc.titleCTexT Afrikaans fastText CBoW String Embeddingsen_ZA
dc.version0.1en_ZA
local.urlhttps://humanities.nwu.ac.za/ctexten_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
fastText.af.model.cbow-300.bin.zip
Size:
3.31 GB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.23 KB
Format:
Item-specific license agreed upon to submission
Description: