TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

natural_questions_open

Description:

The NQ-Open task, introduced by Lee et.al. 2019, is an open domain question answering benchmark that is derived from Natural Questions. The goal is to predict an English answer string for an input English question. All questions can be answered using the contents of English Wikipedia.

Homepage: https://github.com/google-research-datasets/natural-questions/tree/master/nq_open
Source code: tfds.datasets.natural_questions_open.Builder
Versions:
- 1.0.0 (default): No release notes.
Download size: 8.50 MiB
Dataset size: 8.70 MiB
Auto-cached (documentation): Yes
Splits:

Split	Examples
`'train'`	87,925
`'validation'`	3,610

Feature structure:

FeaturesDict({
    'answer': Sequence(string),
    'question': string,
})

Feature documentation:

Feature	Class	Shape	Dtype
	FeaturesDict
answer	Sequence(Tensor)	(None,)	string
question	Tensor		string

Supervised keys (See as_supervised doc): None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):

Citation:

@inproceedings{orqa,
title = {Latent Retrieval for Weakly Supervised Open Domain Question Answering},
author = {Lee, Kenton and Chang, Ming-Wei and Toutanova, Kristina},
year = {2019},
month = {01},
pages = {6086-6096},
booktitle = {Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics},
doi = {10.18653/v1/P19-1612}
}

@article{47761,
title = {Natural Questions: a Benchmark for Question Answering Research},
author = {Tom Kwiatkowski and Jennimaria Palomaki and Olivia Redfield and Michael Collins and Ankur Parikh and Chris Alberti and Danielle Epstein and Illia Polosukhin and Matthew Kelcey and Jacob Devlin and Kenton Lee and Kristina N. Toutanova and Llion Jones and Ming-Wei Chang and Andrew Dai and Jakob Uszkoreit and Quoc Le and Slav Petrov},
year = {2019},
journal = {Transactions of the Association of Computational Linguistics}
}

natural_questions_open Stay organized with collections Save and categorize content based on your preferences.

natural_questions_open