quac

  • Description:

Question Answering in Context is a dataset for modeling, understanding, and participating in information seeking dialog. Data instances consist of an interactive dialog between two crowd workers: (1) a student who poses a sequence of freeform questions to learn as much as possible about a hidden Wikipedia text, and (2) a teacher who answers the questions by providing short excerpts (spans) from the text. QuAC introduces challenges not found in existing machine comprehension datasets: its questions are often more open-ended, unanswerable, or only meaningful within the dialog context.

Split Examples
'train' 83,568
'validation' 7,354
  • Feature structure:
FeaturesDict({
    'answers': Sequence({
        'answer_start': int32,
        'text': Text(shape=(), dtype=string),
    }),
    'background': Text(shape=(), dtype=string),
    'context': Text(shape=(), dtype=string),
    'followup': Text(shape=(), dtype=string),
    'orig_answer': FeaturesDict({
        'answer_start': int32,
        'text': Text(shape=(), dtype=string),
    }),
    'question': Text(shape=(), dtype=string),
    'section_title': Text(shape=(), dtype=string),
    'title': Text(shape=(), dtype=string),
    'yesno': Text(shape=(), dtype=string),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
answers Sequence
answers/answer_start Tensor int32
answers/text Text string
background Text string
context Text string
followup Text string
orig_answer FeaturesDict
orig_answer/answer_start Tensor int32
orig_answer/text Text string
question Text string
section_title Text string
title Text string
yesno Text string
  • Citation:
@article{choi2018quac,
  title={Quac: Question answering in context},
  author={Choi, Eunsol and He, He and Iyyer, Mohit and Yatskar, Mark and Yih, Wen-tau and Choi, Yejin and Liang, Percy and Zettlemoyer, Luke},
  journal={arXiv preprint arXiv:1808.07036},
  year={2018}
}