- Description:
The CFQ dataset (and it's splits) for measuring compositional generalization.
See https://arxiv.org/abs/1912.09713.pdf for background.
A note about the validation set: Since it has the same distribution as the test set and we are interested in measuring the compositional generalization of a model with respect to an unknown test distribution we suggest that any tuning should be done on a subset of the train set only (see section 5.1 of the paper).
Example usage:
data = tfds.load('cfq/mcd1')
Additional Documentation: Explore on Papers With Code
Homepage: https://github.com/google-research/google-research/tree/master/cfq
Source code:
tfds.text.CFQVersions:
1.2.0(default): No release notes.
Download size:
255.20 MiBAuto-cached (documentation): Yes
Feature structure:
FeaturesDict({
'query': Text(shape=(), dtype=string),
'question': Text(shape=(), dtype=string),
})
- Feature documentation:
| Feature | Class | Shape | Dtype | Description |
|---|---|---|---|---|
| FeaturesDict | ||||
| query | Text | string | ||
| question | Text | string |
Supervised keys (See
as_superviseddoc):('question', 'query')Figure (tfds.show_examples): Not supported.
Citation:
@inproceedings{Keysers2020,
title={Measuring Compositional Generalization: A Comprehensive Method on
Realistic Data},
author={Daniel Keysers and Nathanael Sch"{a}rli and Nathan Scales and
Hylke Buisman and Daniel Furrer and Sergii Kashubin and
Nikola Momchev and Danila Sinopalnikov and Lukasz Stafiniak and
Tibor Tihon and Dmitry Tsarkov and Xiao Wang and Marc van Zee and
Olivier Bousquet},
booktitle={ICLR},
year={2020},
url={https://arxiv.org/abs/1912.09713.pdf},
}
cfq/mcd1 (default config)
Dataset size:
49.75 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/mcd2
Dataset size:
51.39 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/mcd3
Dataset size:
50.22 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/question_complexity_split
Dataset size:
52.81 MiBSplits:
| Split | Examples |
|---|---|
'test' |
10,340 |
'train' |
98,999 |
'validation' |
10,339 |
- Examples (tfds.as_dataframe):
cfq/question_pattern_split
Dataset size:
52.81 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,909 |
'train' |
95,654 |
'validation' |
12,115 |
- Examples (tfds.as_dataframe):
cfq/query_complexity_split
Dataset size:
52.81 MiBSplits:
| Split | Examples |
|---|---|
'test' |
9,512 |
'train' |
100,654 |
'validation' |
9,512 |
- Examples (tfds.as_dataframe):
cfq/query_pattern_split
Dataset size:
52.81 MiBSplits:
| Split | Examples |
|---|---|
'test' |
12,589 |
'train' |
94,600 |
'validation' |
12,489 |
- Examples (tfds.as_dataframe):
cfq/random_split
Dataset size:
52.81 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,967 |
'train' |
95,744 |
'validation' |
11,967 |
- Examples (tfds.as_dataframe):
cfq/cd0_r1
Dataset size:
49.72 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r2
Dataset size:
49.45 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r3
Dataset size:
49.76 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r4
Dataset size:
50.04 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r5
Dataset size:
49.36 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r6
Dataset size:
49.36 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r7
Dataset size:
49.93 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r8
Dataset size:
48.58 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0_r9
Dataset size:
49.23 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r1
Dataset size:
51.54 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r2
Dataset size:
48.18 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r3
Dataset size:
49.23 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r4
Dataset size:
49.76 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r5
Dataset size:
49.10 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r6
Dataset size:
49.10 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r7
Dataset size:
48.02 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r8
Dataset size:
48.13 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.1_r9
Dataset size:
48.44 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r1
Dataset size:
50.90 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r2
Dataset size:
49.12 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r3
Dataset size:
50.68 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r4
Dataset size:
52.61 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r5
Dataset size:
50.63 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r6
Dataset size:
50.63 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r7
Dataset size:
52.39 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r8
Dataset size:
50.53 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.2_r9
Dataset size:
50.64 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r1
Dataset size:
52.08 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r2
Dataset size:
52.11 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r3
Dataset size:
51.25 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r4
Dataset size:
49.94 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r5
Dataset size:
49.71 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r6
Dataset size:
49.71 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r7
Dataset size:
50.86 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r8
Dataset size:
49.43 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.3_r9
Dataset size:
50.32 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r1
Dataset size:
48.52 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r2
Dataset size:
47.40 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r3
Dataset size:
49.18 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r4
Dataset size:
47.86 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r5
Dataset size:
49.01 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r6
Dataset size:
49.01 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r7
Dataset size:
51.70 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r8
Dataset size:
50.94 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.4_r9
Dataset size:
51.98 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r1
Dataset size:
51.00 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r2
Dataset size:
49.91 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r3
Dataset size:
51.26 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r4
Dataset size:
51.27 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r5
Dataset size:
51.87 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r6
Dataset size:
51.87 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r7
Dataset size:
48.52 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r8
Dataset size:
48.62 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.5_r9
Dataset size:
48.48 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r1
Dataset size:
49.32 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r2
Dataset size:
49.36 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r3
Dataset size:
49.85 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r4
Dataset size:
50.07 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r5
Dataset size:
49.86 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r6
Dataset size:
49.86 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r7
Dataset size:
49.42 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r8
Dataset size:
48.52 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd0.6_r9
Dataset size:
49.54 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r1
Dataset size:
49.75 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r2
Dataset size:
49.07 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r3
Dataset size:
50.22 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r4
Dataset size:
49.12 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r5
Dataset size:
48.80 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r6
Dataset size:
48.80 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r7
Dataset size:
50.69 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r8
Dataset size:
51.39 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):
cfq/cd1_r9
Dataset size:
49.94 MiBSplits:
| Split | Examples |
|---|---|
'test' |
11,968 |
'train' |
95,743 |
'validation' |
11,968 |
- Examples (tfds.as_dataframe):