Creative Commons Attribution 4.0 International (CC-BY 4.0)Roald EiselenRico KoenAlbertus KrugerJacques van Heerden2023-07-282023-05-012023-07-282023-05-012023-05-01https://hdl.handle.net/20.500.12185/594Static word and subword embeddings for the continuous bag of words (CBoW) flavour of the fastText architecture (Bojanowski et al., 2017). The embedding provides real-valued vector representations for isiXhosa text.Training data: Paragraphs: 718,751; Token count: 13,190,962; Vocab size: 172,170; Embedding dimensions: 600;xhNCHLT isiXhosa fastText-CBoW embeddingsModules3.97GB (Zipped)