- Description:
This dataset consists of 6,642 question/answer pairs. The questions are supposed to be answerable by Freebase, a large knowledge graph. The questions are mostly centered around a single named entity. The questions are popular ones asked on the web (at least in 2013).
Additional Documentation: Explore on Papers With Code
Homepage: https://worksheets.codalab.org/worksheets/0xba659fe363cb46e7a505c5b6a774dc8a
Source code:
tfds.question_answering.WebQuestions
Versions:
1.0.0
(default): No release notes.
Download size:
1.21 MiB
Dataset size:
983.88 KiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'test' |
2,032 |
'train' |
3,778 |
- Feature structure:
FeaturesDict({
'answers': Sequence(Text(shape=(), dtype=string)),
'question': Text(shape=(), dtype=string),
'url': Text(shape=(), dtype=string),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
answers | Sequence(Text) | (None,) | string | |
question | Text | string | ||
url | Text | string |
Supervised keys (See
as_supervised
doc):None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@inproceedings{berant-etal-2013-semantic,
title = "Semantic Parsing on {F}reebase from Question-Answer Pairs",
author = "Berant, Jonathan and
Chou, Andrew and
Frostig, Roy and
Liang, Percy",
booktitle = "Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing",
month = oct,
year = "2013",
address = "Seattle, Washington, USA",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/D13-1160",
pages = "1533--1544",
}