drop
Stay organized with collections
Save and categorize content based on your preferences.
With system performance on existing reading comprehension benchmarks nearing or
surpassing human performance, we need a new, hard dataset that improves systems'
capabilities to actually read paragraphs of text. DROP is a crowdsourced,
adversarially-created, 96k-question benchmark, in which a system must resolve
references in a question, perhaps to multiple input positions, and perform
discrete operations over them (such as addition, counting, or sorting). These
operations require a much more comprehensive understanding of the content of
paragraphs than what was necessary for prior datasets.
Split |
Examples |
'dev' |
9,536 |
'train' |
77,409 |
FeaturesDict({
'answer': Text(shape=(), dtype=string),
'passage': Text(shape=(), dtype=string),
'query_id': Text(shape=(), dtype=string),
'question': Text(shape=(), dtype=string),
'validated_answers': Sequence(Text(shape=(), dtype=string)),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
answer |
Text |
|
string |
|
passage |
Text |
|
string |
|
query_id |
Text |
|
string |
|
question |
Text |
|
string |
|
validated_answers |
Sequence(Text) |
(None,) |
string |
|
@inproceedings{Dua2019DROP,
author={Dheeru Dua and Yizhong Wang and Pradeep Dasigi and Gabriel Stanovsky and Sameer Singh and Matt Gardner},
title={ {DROP}: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs},
booktitle={Proc. of NAACL},
year={2019}
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-06 UTC.
[null,null,["Last updated 2022-12-06 UTC."],[],[],null,["# drop\n\n\u003cbr /\u003e\n\n- **Description**:\n\nWith system performance on existing reading comprehension benchmarks nearing or\nsurpassing human performance, we need a new, hard dataset that improves systems'\ncapabilities to actually read paragraphs of text. DROP is a crowdsourced,\nadversarially-created, 96k-question benchmark, in which a system must resolve\nreferences in a question, perhaps to multiple input positions, and perform\ndiscrete operations over them (such as addition, counting, or sorting). These\noperations require a much more comprehensive understanding of the content of\nparagraphs than what was necessary for prior datasets.\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/drop)\n\n- **Homepage** : \u003chttps://allennlp.org/drop\u003e\n\n- **Source code** :\n [`tfds.text.drop.Drop`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/text/drop/drop.py)\n\n- **Versions**:\n\n - `1.0.0`: Initial release.\n - **`2.0.0`** (default): Add all options for the answers.\n- **Download size** : `7.92 MiB`\n\n- **Dataset size** : `116.24 MiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'dev'` | 9,536 |\n| `'train'` | 77,409 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'answer': Text(shape=(), dtype=string),\n 'passage': Text(shape=(), dtype=string),\n 'query_id': Text(shape=(), dtype=string),\n 'question': Text(shape=(), dtype=string),\n 'validated_answers': Sequence(Text(shape=(), dtype=string)),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|-------------------|----------------|---------|--------|-------------|\n| | FeaturesDict | | | |\n| answer | Text | | string | |\n| passage | Text | | string | |\n| query_id | Text | | string | |\n| question | Text | | string | |\n| validated_answers | Sequence(Text) | (None,) | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @inproceedings{Dua2019DROP,\n author={Dheeru Dua and Yizhong Wang and Pradeep Dasigi and Gabriel Stanovsky and Sameer Singh and Matt Gardner},\n title={ {DROP}: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs},\n booktitle={Proc. of NAACL},\n year={2019}\n }"]]