qasc
Stay organized with collections
Save and categorize content based on your preferences.
QASC is a question-answering dataset with a focus on sentence composition. It
consists of 9,980 8-way multiple-choice questions about grade school science
(8,134 train, 926 dev, 920 test), and comes with a corpus of 17M sentences.
Split |
Examples |
'test' |
920 |
'train' |
8,134 |
'validation' |
926 |
FeaturesDict({
'answerKey': Text(shape=(), dtype=string),
'choices': Sequence({
'label': Text(shape=(), dtype=string),
'text': Text(shape=(), dtype=string),
}),
'combinedfact': Text(shape=(), dtype=string),
'fact1': Text(shape=(), dtype=string),
'fact2': Text(shape=(), dtype=string),
'formatted_question': Text(shape=(), dtype=string),
'id': Text(shape=(), dtype=string),
'question': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
answerKey |
Text |
|
string |
|
choices |
Sequence |
|
|
|
choices/label |
Text |
|
string |
|
choices/text |
Text |
|
string |
|
combinedfact |
Text |
|
string |
|
fact1 |
Text |
|
string |
|
fact2 |
Text |
|
string |
|
formatted_question |
Text |
|
string |
|
id |
Text |
|
string |
|
question |
Text |
|
string |
|
@article{allenai:qasc,
author = {Tushar Khot and Peter Clark and Michal Guerquin and Peter Jansen and Ashish Sabharwal},
title = {QASC: A Dataset for Question Answering via Sentence Composition},
journal = {arXiv:1910.11473v2},
year = {2020},
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-20 UTC.
[null,null,["Last updated 2022-12-20 UTC."],[],[],null,["# qasc\n\n\u003cbr /\u003e\n\n- **Description**:\n\nQASC is a question-answering dataset with a focus on sentence composition. It\nconsists of 9,980 8-way multiple-choice questions about grade school science\n(8,134 train, 926 dev, 920 test), and comes with a corpus of 17M sentences.\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/qasc)\n\n- **Homepage** : \u003chttps://allenai.org/data/qasc\u003e\n\n- **Source code** :\n [`tfds.datasets.qasc.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/qasc/qasc_dataset_builder.py)\n\n- **Versions**:\n\n - **`0.1.0`** (default): No release notes.\n- **Download size** : `1.54 MiB`\n\n- **Dataset size** : `6.61 MiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Splits**:\n\n| Split | Examples |\n|----------------|----------|\n| `'test'` | 920 |\n| `'train'` | 8,134 |\n| `'validation'` | 926 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'answerKey': Text(shape=(), dtype=string),\n 'choices': Sequence({\n 'label': Text(shape=(), dtype=string),\n 'text': Text(shape=(), dtype=string),\n }),\n 'combinedfact': Text(shape=(), dtype=string),\n 'fact1': Text(shape=(), dtype=string),\n 'fact2': Text(shape=(), dtype=string),\n 'formatted_question': Text(shape=(), dtype=string),\n 'id': Text(shape=(), dtype=string),\n 'question': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|--------------------|--------------|-------|--------|-------------|\n| | FeaturesDict | | | |\n| answerKey | Text | | string | |\n| choices | Sequence | | | |\n| choices/label | Text | | string | |\n| choices/text | Text | | string | |\n| combinedfact | Text | | string | |\n| fact1 | Text | | string | |\n| fact2 | Text | | string | |\n| formatted_question | Text | | string | |\n| id | Text | | string | |\n| question | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @article{allenai:qasc,\n author = {Tushar Khot and Peter Clark and Michal Guerquin and Peter Jansen and Ashish Sabharwal},\n title = {QASC: A Dataset for Question Answering via Sentence Composition},\n journal = {arXiv:1910.11473v2},\n year = {2020},\n }"]]