- Description:
A dataset of 8.5K high quality linguistically diverse grade school math word problems.
Additional Documentation: Explore on Papers With Code
Source code:
tfds.text.gsm8k.Gsm8kVersions:
1.0.0(default): Initial release.
Download size:
10.77 MiBDataset size:
17.84 MiBAuto-cached (documentation): Yes
Splits:
| Split | Examples |
|---|---|
'test' |
1,319 |
'test_socratic' |
1,319 |
'train' |
7,473 |
'train_socratic' |
7,473 |
- Feature structure:
FeaturesDict({
'annotation': Text(shape=(), dtype=string),
'answer': Text(shape=(), dtype=string),
'question': Text(shape=(), dtype=string),
'short_answer': Text(shape=(), dtype=string),
})
- Feature documentation:
| Feature | Class | Shape | Dtype | Description |
|---|---|---|---|---|
| FeaturesDict | ||||
| annotation | Text | string | ||
| answer | Text | string | ||
| question | Text | string | ||
| short_answer | Text | string |
Supervised keys (See
as_superviseddoc):NoneFigure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@misc{cobbe2021training,
title={Training Verifiers to Solve Math Word Problems},
author={Karl Cobbe and Vineet Kosaraju and Mohammad Bavarian and Jacob Hilton and Reiichiro Nakano and Christopher Hesse and John Schulman},
year={2021},
eprint={2110.14168},
archivePrefix={arXiv},
primaryClass={cs.LG}
}