sun397

The database contains 108,753 images of 397 categories, used in the Scene UNderstanding (SUN) benchmark. The number of images varies across categories, but there are at least 100 images per category.

Several configs of the dataset are made available through TFDS:

  • A custom (random) partition of the whole dataset with 76,128 training images, 10,875 validation images and 21,750 test images. Images have been resized to have at most 120,000 pixels, and encoded as JPEG with quality of 72.

  • "standard-part1-120k", "standard-part2-120k", ..., "standard-part10-120k": Each of the 10 official train/test partitions with 50 images per class in each split. Images have been resized to have at most 120,000 pixels, and encoded as JPEG with quality of 72.

  • Additional Documentation: Explore on Papers With Code

  • Homepage: https://vision.princeton.edu/projects/2010/SUN/

  • Source code: tfds.datasets.sun397.Builder

  • Versions:

    • 4.0.0 (default): No release notes.
  • Download size: 36.41 GiB

  • Dataset size: Unknown size

  • Auto-cached (documentation): Unknown

  • Feature structure:

FeaturesDict({
    'file_name': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=397),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
file_name Text string
image Image (None, None, 3) uint8
label ClassLabel int64
@INPROCEEDINGS{Xiao:2010,
author={J. {Xiao} and J. {Hays} and K. A. {Ehinger} and A. {Oliva} and A. {Torralba} },
booktitle={2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition},
title={SUN database: Large-scale scene recognition from abbey to zoo},
year={2010},
volume={},
number={},
pages={3485-3492},
keywords={computer vision;human factors;image classification;object recognition;visual databases;SUN database;large-scale scene recognition;abbey;zoo;scene categorization;computer vision;scene understanding research;scene category;object categorization;scene understanding database;state-of-the-art algorithms;human scene classification performance;finer-grained scene representation;Sun;Large-scale systems;Layout;Humans;Image databases;Computer vision;Anthropometry;Bridges;Legged locomotion;Spatial databases},
doi={10.1109/CVPR.2010.5539970},
ISSN={1063-6919},
month={June},}

sun397/tfds (default config)

  • Config description: TFDS partition with random train/validation/test splits with 70%/10%/20% of the images, respectively. Images are resized to have at most 120,000 pixels, and are compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 21,750
'train' 76,128
'validation' 10,875

Visualization

sun397/standard-part1-120k

  • Config description: Train and test splits from the official partition number 1. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part2-120k

  • Config description: Train and test splits from the official partition number 2. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part3-120k

  • Config description: Train and test splits from the official partition number 3. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part4-120k

  • Config description: Train and test splits from the official partition number 4. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part5-120k

  • Config description: Train and test splits from the official partition number 5. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part6-120k

  • Config description: Train and test splits from the official partition number 6. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part7-120k

  • Config description: Train and test splits from the official partition number 7. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part8-120k

  • Config description: Train and test splits from the official partition number 8. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part9-120k

  • Config description: Train and test splits from the official partition number 9. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization

sun397/standard-part10-120k

  • Config description: Train and test splits from the official partition number 10. Images are resized to have at most 120,000 pixels, and compressed with 72 JPEG quality.

  • Splits:

Split Examples
'test' 19,850
'train' 19,850

Visualization