CTexT fastText Skipgram String Embeddings
Please do not copy the URL from the browser for citation. The correct URL is 'https://hdl.handle.net/20.500.12185/554'
dc.contact.email | Roald.Eiselen@nwu.ac.za | en_ZA |
dc.contact.name | Roald Eiselen | en_ZA |
dc.contributor.author | Eiselen, Roald | |
dc.contributor.other | Eiselen, Roald | |
dc.date.accessioned | 2022-02-03T09:06:10Z | |
dc.date.available | 2022-02-03T09:06:10Z | |
dc.date.issued | 2022-01-10 | |
dc.description | The CTexT Afrikaans fastText Skipgram String Embeddings is a 300 dimensional Afrikaans embedding model based on the Skipgram fastText architecture that provides real-valued vector representations for Afrikaans text. The embedding was trained on a corpus of 230 million words. | en_ZA |
dc.format | Pickel | en_ZA |
dc.format.extent | 230 million words | en_ZA |
dc.format.medium | N/A | en_ZA |
dc.format.size | 3.31 Gb | en_ZA |
dc.identifier.uri | https://hdl.handle.net/20.500.12185/554 | |
dc.languages | Afrikaans | en_ZA |
dc.media.category | String embeddings | en_ZA |
dc.publisher | Centre for Text Technology (CTexT) | en_ZA |
dc.rights.license | Creative Commons Attribution-Noncommercial 4.0 International (CC BY-NC 4.0): https://creativecommons.org/licenses/by-nc/4.0/ | en_ZA |
dc.subject | String embeddings | en_ZA |
dc.subject | fastText | en_ZA |
dc.subject | Word embedding | en_ZA |
dc.title | CTexT fastText Skipgram String Embeddings | en_ZA |
dc.version | 0.1 | en_ZA |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- fastText.af.model.skipgram-300.bin.zip
- Size:
- 3.31 GB
- Format:
- ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 3.23 KB
- Format:
- Item-specific license agreed upon to submission
- Description: