चतुर

विवरण :

CLEVR एक डायग्नोस्टिक डेटासेट है जो दृश्य तर्क क्षमताओं की एक श्रृंखला का परीक्षण करता है। इसमें न्यूनतम पूर्वाग्रह हैं और प्रत्येक प्रश्न के लिए आवश्यक तर्क का वर्णन करने वाली विस्तृत टिप्पणियाँ हैं।

अतिरिक्त दस्तावेज़ीकरण : कोड वाले कागजात पर अन्वेषण करें
मुखपृष्ठ : https://cs.stanford.edu/people/jcjohns/clevr/
स्रोत कोड : tfds.datasets.clevr.Builder
संस्करण :
- 3.0.0 : कोई रिलीज़ नोट नहीं।
- 3.1.0 (डिफ़ॉल्ट): प्रश्न/उत्तर पाठ जोड़ें।
डाउनलोड आकार : 17.72 GiB
डेटासेट का आकार : 17.75 GiB
ऑटो-कैश्ड ( दस्तावेज़ीकरण ): नहीं
विभाजन :

विभाजित करना	उदाहरण
`'test'`	15,000
`'train'`	70,000
`'validation'`	15,000

फ़ीचर संरचना :

FeaturesDict({
    'file_name': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'objects': Sequence({
        '3d_coords': Tensor(shape=(3,), dtype=float32),
        'color': ClassLabel(shape=(), dtype=int64, num_classes=8),
        'material': ClassLabel(shape=(), dtype=int64, num_classes=2),
        'pixel_coords': Tensor(shape=(3,), dtype=float32),
        'rotation': float32,
        'shape': ClassLabel(shape=(), dtype=int64, num_classes=3),
        'size': ClassLabel(shape=(), dtype=int64, num_classes=2),
    }),
    'question_answer': Sequence({
        'answer': Text(shape=(), dtype=string),
        'question': Text(shape=(), dtype=string),
    }),
})

फ़ीचर दस्तावेज़ीकरण :

विशेषता	कक्षा	आकार	डीप्रकार
	फीचर्सडिक्ट
फ़ाइल का नाम	मूलपाठ		डोरी
छवि	छवि	(कोई नहीं, कोई नहीं, 3)	uint8
वस्तुओं	अनुक्रम
ऑब्जेक्ट/3डी_कोर्ड्स	टेन्सर	(3,)	फ्लोट32
वस्तुएं/रंग	क्लास लेबल		int64
वस्तुएं/सामग्री	क्लास लेबल		int64
ऑब्जेक्ट/पिक्सेल_कोर्ड्स	टेन्सर	(3,)	फ्लोट32
ऑब्जेक्ट/रोटेशन	टेन्सर		फ्लोट32
वस्तुएं/आकार	क्लास लेबल		int64
वस्तुएं/आकार	क्लास लेबल		int64
प्रश्न जवाब	अनुक्रम
प्रश्न_उत्तर/उत्तर	मूलपाठ		डोरी
प्रश्न_उत्तर/प्रश्न	मूलपाठ		डोरी

पर्यवेक्षित कुंजियाँ ( as_supervised doc देखें): None
चित्र ( tfds.show_examples ):

VISUALIZATION

उदाहरण ( tfds.as_dataframe ):

उद्धरण :

@inproceedings{johnson2017clevr,
  title={ {CLEVR}: A diagnostic dataset for compositional language and elementary visual reasoning},
  author={Johnson, Justin and Hariharan, Bharath and van der Maaten, Laurens and Fei-Fei, Li and Lawrence Zitnick, C and Girshick, Ross},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2017}
}