TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

xtreme_xnli

Description:

This dataset contains machine translations of MNLI into each of the XNLI languages. The translation data is provided by XTREME. Note that this is different from the machine translated data provided by the original XNLI paper.

Homepage: https://www.nyu.edu/projects/bowman/xnli/
Source code: tfds.text.xtreme_xnli.XtremeXnli
Versions:
- 1.1.0 (default): No release notes.
Download size: 2.31 GiB
Dataset size: 1.59 GiB
Auto-cached (documentation): No
Splits:

Split	Examples
`'train'`	392,570

Feature structure:

FeaturesDict({
    'hypothesis': TranslationVariableLanguages({
        'language': Text(shape=(), dtype=string),
        'translation': Text(shape=(), dtype=string),
    }),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=3),
    'premise': Translation({
        'ar': Text(shape=(), dtype=string),
        'bg': Text(shape=(), dtype=string),
        'de': Text(shape=(), dtype=string),
        'el': Text(shape=(), dtype=string),
        'en': Text(shape=(), dtype=string),
        'es': Text(shape=(), dtype=string),
        'fr': Text(shape=(), dtype=string),
        'hi': Text(shape=(), dtype=string),
        'ru': Text(shape=(), dtype=string),
        'sw': Text(shape=(), dtype=string),
        'th': Text(shape=(), dtype=string),
        'tr': Text(shape=(), dtype=string),
        'ur': Text(shape=(), dtype=string),
        'vi': Text(shape=(), dtype=string),
        'zh': Text(shape=(), dtype=string),
    }),
})

Feature documentation:

Feature	Class	Dtype
	FeaturesDict
hypothesis	TranslationVariableLanguages
hypothesis/language	Text	string
hypothesis/translation	Text	string
label	ClassLabel	int64
premise	Translation
premise/ar	Text	string
premise/bg	Text	string
premise/de	Text	string
premise/el	Text	string
premise/en	Text	string
premise/es	Text	string
premise/fr	Text	string
premise/hi	Text	string
premise/ru	Text	string
premise/sw	Text	string
premise/th	Text	string
premise/tr	Text	string
premise/ur	Text	string
premise/vi	Text	string
premise/zh	Text	string

Supervised keys (See as_supervised doc): None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):

Citation:

@article{hu2020xtreme,
      author    = {Junjie Hu and Sebastian Ruder and Aditya Siddhant and Graham Neubig and Orhan Firat and Melvin Johnson},
      title     = {XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization},
      journal   = {CoRR},
      volume    = {abs/2003.11080},
      year      = {2020},
      archivePrefix = {arXiv},
      eprint    = {2003.11080}
}