hellaswag
Stay organized with collections
Save and categorize content based on your preferences.
The HellaSwag dataset is a benchmark for Commonsense NLI. It includes a context
and some endings which complete the context.
Split |
Examples |
'test' |
10,003 |
'test_ind_activitynet' |
1,870 |
'test_ind_wikihow' |
3,132 |
'test_ood_activitynet' |
1,651 |
'test_ood_wikihow' |
3,350 |
'train' |
39,905 |
'train_activitynet' |
14,740 |
'train_wikihow' |
25,165 |
'validation' |
10,042 |
'validation_ind_activitynet' |
1,809 |
'validation_ind_wikihow' |
3,192 |
'validation_ood_activitynet' |
1,434 |
'validation_ood_wikihow' |
3,607 |
FeaturesDict({
'activity_label': Text(shape=(), dtype=string),
'context': Text(shape=(), dtype=string),
'endings': Sequence(Text(shape=(), dtype=string)),
'label': int32,
'source_id': Text(shape=(), dtype=string),
'split_type': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
activity_label |
Text |
|
string |
|
context |
Text |
|
string |
|
endings |
Sequence(Text) |
(None,) |
string |
|
label |
Tensor |
|
int32 |
|
source_id |
Text |
|
string |
|
split_type |
Text |
|
string |
|
@inproceedings{zellers2019hellaswag,
title={HellaSwag: Can a Machine Really Finish Your Sentence?},
author={Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin},
booktitle ={Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics},
year={2019}
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-06 UTC.
[null,null,["Last updated 2022-12-06 UTC."],[],[],null,["# hellaswag\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThe HellaSwag dataset is a benchmark for Commonsense NLI. It includes a context\nand some endings which complete the context.\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/hellaswag)\n\n- **Homepage** :\n \u003chttps://rowanzellers.com/hellaswag/\u003e\n\n- **Source code** :\n [`tfds.text.Hellaswag`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/text/hellaswag.py)\n\n- **Versions**:\n\n - `0.0.1`: No release notes.\n - `1.0.0`: Adding separate splits for in-domain and out-of-domain validation/test sets.\n - **`1.1.0`** (default): Another split dimension for source (wikihow vs activitynet)\n- **Download size** : `68.18 MiB`\n\n- **Dataset size** : `107.45 MiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Splits**:\n\n| Split | Examples |\n|--------------------------------|----------|\n| `'test'` | 10,003 |\n| `'test_ind_activitynet'` | 1,870 |\n| `'test_ind_wikihow'` | 3,132 |\n| `'test_ood_activitynet'` | 1,651 |\n| `'test_ood_wikihow'` | 3,350 |\n| `'train'` | 39,905 |\n| `'train_activitynet'` | 14,740 |\n| `'train_wikihow'` | 25,165 |\n| `'validation'` | 10,042 |\n| `'validation_ind_activitynet'` | 1,809 |\n| `'validation_ind_wikihow'` | 3,192 |\n| `'validation_ood_activitynet'` | 1,434 |\n| `'validation_ood_wikihow'` | 3,607 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'activity_label': Text(shape=(), dtype=string),\n 'context': Text(shape=(), dtype=string),\n 'endings': Sequence(Text(shape=(), dtype=string)),\n 'label': int32,\n 'source_id': Text(shape=(), dtype=string),\n 'split_type': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------------|----------------|---------|--------|-------------|\n| | FeaturesDict | | | |\n| activity_label | Text | | string | |\n| context | Text | | string | |\n| endings | Sequence(Text) | (None,) | string | |\n| label | Tensor | | int32 | |\n| source_id | Text | | string | |\n| split_type | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @inproceedings{zellers2019hellaswag,\n title={HellaSwag: Can a Machine Really Finish Your Sentence?},\n author={Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin},\n booktitle ={Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics},\n year={2019}\n }"]]