natural_instructions
Stay organized with collections
Save and categorize content based on your preferences.
Description
A compilation of 1600+ tasks phrased as natural instructions. The original task
collection can be found at: https://github.com/allenai/natural-instructions No
preprocessing or changes were made to this original version.
Note that users of this task collection should consult the underlying licenses
of the contained datasets, and cite them accordingly.
Split |
Examples |
'train' |
5,040,134 |
FeaturesDict({
'definition': Text(shape=(), dtype=string),
'id': Text(shape=(), dtype=string),
'input': Text(shape=(), dtype=string),
'output': Text(shape=(), dtype=string),
'source': Text(shape=(), dtype=string),
'task_name': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
definition |
Text |
|
string |
|
id |
Text |
|
string |
|
input |
Text |
|
string |
|
output |
Text |
|
string |
|
source |
Text |
|
string |
|
task_name |
Text |
|
string |
|
@article{wang2022benchmarking,
title={Benchmarking generalization via in-context instructions on 1,600+ language tasks},
author={Wang, Yizhong and Mishra, Swaroop and Alipoormolabashi, Pegah and Kordi, Yeganeh and Mirzaei, Amirreza and Arunkumar, Anjana and Ashok, Arjun and Dhanasekaran, Arut Selvan and Naik, Atharva and Stap, David and others},
journal={arXiv preprint arXiv:2204.07705},
year={2022}
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-22 UTC.
[null,null,["Last updated 2022-12-22 UTC."],[],[],null,["# natural_instructions\n\n\u003cbr /\u003e\n\n- **Description**:\n\nDescription\n===========\n\nA compilation of 1600+ tasks phrased as natural instructions. The original task\ncollection can be found at: \u003chttps://github.com/allenai/natural-instructions\u003e No\npreprocessing or changes were made to this original version.\n\nNote that users of this task collection should consult the underlying licenses\nof the contained datasets, and cite them accordingly.\n\n- **Homepage** :\n \u003chttps://github.com/allenai/natural-instructions\u003e\n\n- **Source code** :\n [`tfds.datasets.natural_instructions.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/natural_instructions/natural_instructions_dataset_builder.py)\n\n- **Versions**:\n\n - `1.0.0`: Initial release.\n - **`1.0.1`** (default): Added task name field, and fixed ID used for shuffling to use stable IDs.\n- **Download size** : `3.08 GiB`\n\n- **Dataset size** : `4.73 GiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|-----------|\n| `'train'` | 5,040,134 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'definition': Text(shape=(), dtype=string),\n 'id': Text(shape=(), dtype=string),\n 'input': Text(shape=(), dtype=string),\n 'output': Text(shape=(), dtype=string),\n 'source': Text(shape=(), dtype=string),\n 'task_name': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|------------|--------------|-------|--------|-------------|\n| | FeaturesDict | | | |\n| definition | Text | | string | |\n| id | Text | | string | |\n| input | Text | | string | |\n| output | Text | | string | |\n| source | Text | | string | |\n| task_name | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @article{wang2022benchmarking,\n title={Benchmarking generalization via in-context instructions on 1,600+ language tasks},\n author={Wang, Yizhong and Mishra, Swaroop and Alipoormolabashi, Pegah and Kordi, Yeganeh and Mirzaei, Amirreza and Arunkumar, Anjana and Ashok, Arjun and Dhanasekaran, Arut Selvan and Naik, Atharva and Stap, David and others},\n journal={arXiv preprint arXiv:2204.07705},\n year={2022}\n }"]]