open_images_challenge2019_detection

  • Description:

Open Images is a collaborative release of ~9 million images annotated with image-level labels, object bounding boxes, object segmentation masks, and visual relationships. This uniquely large and diverse dataset is designed to spur state of the art advances in analyzing and understanding images.

This contains the data from thee Object Detection track of the competition. The goal in this track is to predict a tight bounding box around all object instances of 500 classes.

The images are annotated with positive image-level labels, indicating certain object classes are present, and with negative image-level labels, indicating certain classes are absent. In the competition, all other unannotated classes are excluded from evaluation in that image. For each positive image-level label in an image, every instance of that object class in the image was annotated.

Split Examples
'test' 99,999
'train' 1,743,042
'validation' 41,620
  • Feature structure:
FeaturesDict({
    'bobjects': Sequence({
        'bbox': BBoxFeature(shape=(4,), dtype=float32),
        'is_group_of': bool,
        'label': ClassLabel(shape=(), dtype=int64, num_classes=500),
    }),
    'id': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'objects': Sequence({
        'confidence': float32,
        'label': ClassLabel(shape=(), dtype=int64, num_classes=500),
        'source': Text(shape=(), dtype=string),
    }),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
bobjects Sequence
bobjects/bbox BBoxFeature (4,) float32
bobjects/is_group_of Tensor bool
bobjects/label ClassLabel int64
id Text string
image Image (None, None, 3) uint8
objects Sequence
objects/confidence Tensor float32
objects/label ClassLabel int64
objects/source Text string

open_images_challenge2019_detection/200k (default config)

  • Config description: Images have at most 200,000 pixels, at 72 JPEG quality.

  • Dataset size: 59.06 GiB

  • Figure (tfds.show_examples):

Visualization

open_images_challenge2019_detection/300k

  • Config description: Images have at most 300,000 pixels, at 72 JPEG quality.

  • Dataset size: 80.10 GiB

  • Figure (tfds.show_examples):

Visualization