gsm8k

  • Description:

A dataset of 8.5K high quality linguistically diverse grade school math word problems.

Split Examples
'test' 1,319
'test_socratic' 1,319
'train' 7,473
'train_socratic' 7,473
  • Feature structure:
FeaturesDict({
    'annotation': Text(shape=(), dtype=string),
    'answer': Text(shape=(), dtype=string),
    'question': Text(shape=(), dtype=string),
    'short_answer': Text(shape=(), dtype=string),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
annotation Text string
answer Text string
question Text string
short_answer Text string
  • Citation:
@misc{cobbe2021training,
      title={Training Verifiers to Solve Math Word Problems},
      author={Karl Cobbe and Vineet Kosaraju and Mohammad Bavarian and Jacob Hilton and Reiichiro Nakano and Christopher Hesse and John Schulman},
      year={2021},
      eprint={2110.14168},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}