TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

open_images_challenge2019_detection

Description:

Open Images is a collaborative release of ~9 million images annotated with image-level labels, object bounding boxes, object segmentation masks, and visual relationships. This uniquely large and diverse dataset is designed to spur state of the art advances in analyzing and understanding images.

This contains the data from thee Object Detection track of the competition. The goal in this track is to predict a tight bounding box around all object instances of 500 classes.

The images are annotated with positive image-level labels, indicating certain object classes are present, and with negative image-level labels, indicating certain classes are absent. In the competition, all other unannotated classes are excluded from evaluation in that image. For each positive image-level label in an image, every instance of that object class in the image was annotated.

Homepage: https://storage.googleapis.com/openimages/web/challenge2019.html
Source code: tfds.datasets.open_images_challenge2019_detection.Builder
Versions:
- 1.0.0 (default): No release notes.
Download size: 534.63 GiB
Auto-cached (documentation): No
Splits:

Split	Examples
`'test'`	99,999
`'train'`	1,743,042
`'validation'`	41,620

Feature structure:

FeaturesDict({
    'bobjects': Sequence({
        'bbox': BBoxFeature(shape=(4,), dtype=float32),
        'is_group_of': bool,
        'label': ClassLabel(shape=(), dtype=int64, num_classes=500),
    }),
    'id': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'objects': Sequence({
        'confidence': float32,
        'label': ClassLabel(shape=(), dtype=int64, num_classes=500),
        'source': Text(shape=(), dtype=string),
    }),
})

Feature documentation:

Feature	Class	Shape	Dtype
	FeaturesDict
bobjects	Sequence
bobjects/bbox	BBoxFeature	(4,)	float32
bobjects/is_group_of	Tensor		bool
bobjects/label	ClassLabel		int64
id	Text		string
image	Image	(None, None, 3)	uint8
objects	Sequence
objects/confidence	Tensor		float32
objects/label	ClassLabel		int64
objects/source	Text		string