TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

imagenet_lt

Description:

ImageNet-LT is a subset of original ImageNet ILSVRC 2012 dataset. The training set is subsampled such that the number of images per class follows a long-tailed distribution. The class with the maximum number of images contains 1,280 examples, whereas the class with the minumum number of images contains only 5 examples. The dataset also has a balanced validation set, which is also a subset of the ImageNet ILSVRC 2012 training set and contains 20 images per class. The test set of this dataset is the same as the validation set of the original ImageNet ILSVRC 2012 dataset.

The original ImageNet ILSVRC 2012 dataset must be downloaded manually, and its path should be set with --manual_dir in order to generate this dataset.

Additional Documentation: Explore on Papers With Code
Homepage: https://github.com/zhmiao/OpenLongTailRecognition-OLTR
Source code: tfds.datasets.imagenet_lt.Builder
Versions:
- 1.0.0 (default): Initial release.
Download size: 5.21 MiB
Dataset size: 20.92 GiB
Manual download instructions: This dataset requires you to download the source data manually into download_config.manual_dir (defaults to ~/tensorflow_datasets/downloads/manual/):
manual_dir should contain two files: ILSVRC2012_img_train.tar and ILSVRC2012_img_val.tar. You need to register on http://www.image-net.org/download-images in order to get the link to download the dataset.
Auto-cached (documentation): No
Splits:

Split	Examples
`'test'`	50,000
`'train'`	115,846
`'validation'`	20,000

Feature structure:

FeaturesDict({
    'file_name': Text(shape=(), dtype=string),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=1000),
})

Feature documentation:

Feature	Class	Shape	Dtype
	FeaturesDict
file_name	Text		string
image	Image	(None, None, 3)	uint8
label	ClassLabel		int64

Supervised keys (See as_supervised doc): ('image', 'label')
Figure (tfds.show_examples):

Visualization

Examples (tfds.as_dataframe):

Citation:

\
@inproceedings{openlongtailrecognition,
  title={Large-Scale Long-Tailed Recognition in an Open World},
  author={Liu, Ziwei and Miao, Zhongqi and Zhan, Xiaohang and Wang, Jiayun and Gong, Boqing and Yu, Stella X.},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2019},
  url={https://github.com/zhmiao/OpenLongTailRecognition-OLTR}
}