Department of Science, Technology and InnovationCLARIN in South Africa

Afri-XNLI+: Extending the XNLI Dataset with New African Language Translations

dc.contact.emailalfr3do@hanyang.ac.kr
dc.contact.nameAlfred Malengo Kondoro
dc.contributor.authorKondoro, Alfred Malengo
dc.contributor.authorIbejih, Sharon
dc.contributor.authorAmol, Cynthia
dc.contributor.authorKeshinro, Damilare
dc.contributor.authorOlusanya, Joy
dc.contributor.authorRudolf, Enyene
dc.contributor.authorPopoola, Deborah
dc.contributor.authorAkintola, Moronfoluwa
dc.contributor.authorMbici, Crispus Wanene
dc.contributor.authorImfurayase, Theogene
dc.contributor.authorAdam, Faisal
dc.contributor.authorEtokhe, Uyemhi
dc.contributor.authorMuhammad, Sunusi
dc.contributor.authorAlhassan, Munzali
dc.contributor.authorAmalagu, Chikezie
dc.contributor.authorEbube, Nwaigbo
dc.contributor.authorAnikwenze, Chinenye
dc.contributor.authorUlasi, Esther
dc.contributor.authorMwende, Mary
dc.contributor.authorNga'ang'a, Wambui
dc.contributor.authorJuma, Diana
dc.contributor.authorNgai, Catherine
dc.contributor.authorWangui, Gladys
dc.contributor.authorMugo, Simon
dc.contributor.authorNdau, Esther
dc.contributor.authorKiiru, Consolata
dc.contributor.authorGithinji, Benedict
dc.contributor.authorMurage, Moureen
dc.contributor.authorGachie, John
dc.contributor.authorMaxime, Shema
dc.contributor.authorPretty, Leah
dc.contributor.authorDufitumukiza, Chaloom
dc.contributor.authorMbarushimana, Vedaste
dc.contributor.authorJaninne, Kanyambo
dc.contributor.authorIshimwe, Kefa
dc.contributor.authorTuyishimire, Regis
dc.contributor.authorMukasekuru, Joselyne
dc.contributor.authorOdhiambo, Nelson
dc.contributor.authorAuma, Judith
dc.contributor.authorMuga, John William
dc.contributor.authorObaje, Margaret
dc.contributor.authorOdhiambo, Johanes
dc.contributor.authorMula, Geophrey
dc.contributor.authorGod'spraise, Okechukwu
dc.contributor.authorBuliaminu, Odunayo
dc.contributor.authorPeter, Teresa
dc.contributor.authorValerie, Mary
dc.contributor.authorMuthungu, Simon
dc.contributor.authorKituyi, John
dc.contributor.authorSintamei, Scovia
dc.contributor.authorMunira, Anne
dc.contributor.authorMaina, Ezekiel
dc.contributor.authorOdhiambo, Sonia
dc.contributor.authorAmouk, Judith
dc.contributor.authorNyaboke, Britney
dc.contributor.authorOnkoba, Edwin
dc.contributor.authorNjenga, Mary
dc.contributor.authorAmouk, Priscilla
dc.contributor.authorAndati, Lexy
dc.contributor.authorAkoth, Dorothy
dc.contributor.authorKinyaiya, Gladness
dc.contributor.authorOkech, Martin
dc.contributor.authorOmino, Peter
dc.contributor.authorOtiende, Verrah
dc.contributor.authorTome, Meshack
dc.contributor.authorMwereza, Ephraim
dc.contributor.authorKituku, Edward
dc.contributor.authorIkiki, Sally
dc.contributor.authorWangatya, Collins
dc.contributor.authorWasonga, Chlaris
dc.contributor.authorKessy, Prisila
dc.contributor.authorAdebayo, Olajumoke
dc.contributor.authorOdeyale, Kehinde
dc.contributor.authorPraise, Adeyemi
dc.contributor.authorAbisoye, Esther
dc.contributor.authorAdelakun, Sulaimon
dc.contributor.authorIge, Theophilus
dc.contributor.authorOyewole, Abisola
dc.contributor.authorBankole, Barakat
dc.contributor.authorMuthomi, Festus
dc.contributor.authorAnih, Mercy
dc.contributor.authorLekan, Idris
dc.date.accessioned2026-03-06T19:20:06Z
dc.date.available2026-03-06T19:20:06Z
dc.date.issued2026
dc.descriptionAfri-XNLI+ is a multilingual dataset consisting of translated premise-hypothesis sentence pairs from the XNLI development and test sets. The resource provides aligned translations for multiple African languages while preserving the original entailment, contradiction, and neutral labels. The dataset was created using a hybrid workflow combining machine translation assistance, human translation, and validation.
dc.formatparquet
dc.format.extentThousands of sentence pairs (aligned premise-hypothesis pairs); ongoing expansion
dc.format.size4.17 MB
dc.identifier.citationKondoro, A. M., Ibejih, S., Amol, C., et al. (2026). XNLI Extension: Cross-lingual Sentence Pairs for African Languages. Tonative. Hugging Face. https://doi.org/10.57967/hf/7718
dc.identifier.urihttps://hdl.handle.net/20.500.12185/695
dc.languagesEnglish
dc.languagesYoruba
dc.languagesOther
dc.languages.otherHausa, Igbo, Kikuyu, Kinyarwanda, Luo, Nigerian Pidgin
dc.media.typeText
dc.publisherTonative (HiggingFace)
dc.rights.licenseCreative Commons Attribution 4.0 International (CC BY 4.0): https://creativecommons.org/licenses/by/4.0/deed.en
dc.subjectAfrican languages
dc.subjectNatural Language Inference
dc.subjectmultilingual NLP
dc.subjectmachine translation
dc.subjectlanguage resources
dc.subjectcross-lingual evaluation
dc.subjectdataset
dc.titleAfri-XNLI+: Extending the XNLI Dataset with New African Language Translations
dc.version1.0

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
xnli-extension .zip
Size:
4.17 MB
Format:
ZIP is an archive file format that supports lossless data compression. A ZIP file may contain one or more files or directories that may have been compressed.

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.22 KB
Format:
Item-specific license agreed upon to submission
Description: