Department of Science, Technology and InnovationCLARIN in South Africa

Afri-XNLI+: Extending the XNLI Dataset with New African Language Translations

Loading...
Thumbnail Image

Date

2026

Authors

Kondoro, Alfred Malengo
Ibejih, Sharon
Amol, Cynthia
Keshinro, Damilare
Olusanya, Joy
Rudolf, Enyene
Popoola, Deborah
Akintola, Moronfoluwa
Mbici, Crispus Wanene
Imfurayase, Theogene

Journal Title

Journal ISSN

Volume Title

Publisher

Tonative (HiggingFace)

Abstract

Description

Afri-XNLI+ is a multilingual dataset consisting of translated premise-hypothesis sentence pairs from the XNLI development and test sets. The resource provides aligned translations for multiple African languages while preserving the original entailment, contradiction, and neutral labels. The dataset was created using a hybrid workflow combining machine translation assistance, human translation, and validation.

Citation

Kondoro, A. M., Ibejih, S., Amol, C., et al. (2026). XNLI Extension: Cross-lingual Sentence Pairs for African Languages. Tonative. Hugging Face. https://doi.org/10.57967/hf/7718

Verification status

Level 0