TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

gsm8k

Description:

A dataset of 8.5K high quality linguistically diverse grade school math word problems.

Additional Documentation: Explore on Papers With Code
Homepage: https://github.com/openai/grade-school-math
Source code: tfds.text.gsm8k.Gsm8k
Versions:
- 1.0.0 (default): Initial release.
Download size: 10.77 MiB
Dataset size: 17.84 MiB
Auto-cached (documentation): Yes
Splits:

Split	Examples
`'test'`	1,319
`'test_socratic'`	1,319
`'train'`	7,473
`'train_socratic'`	7,473

Feature structure:

FeaturesDict({
    'annotation': Text(shape=(), dtype=string),
    'answer': Text(shape=(), dtype=string),
    'question': Text(shape=(), dtype=string),
    'short_answer': Text(shape=(), dtype=string),
})

Feature documentation:

Feature	Class	Dtype
	FeaturesDict
annotation	Text	string
answer	Text	string
question	Text	string
short_answer	Text	string

Supervised keys (See as_supervised doc): None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):

Citation:

@misc{cobbe2021training,
      title={Training Verifiers to Solve Math Word Problems},
      author={Karl Cobbe and Vineet Kosaraju and Mohammad Bavarian and Jacob Hilton and Reiichiro Nakano and Christopher Hesse and John Schulman},
      year={2021},
      eprint={2110.14168},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

gsm8k Stay organized with collections Save and categorize content based on your preferences.

gsm8k