stanford_dogs

説明:

Stanford Dogs データセットには、世界中の 120 種類の犬の画像が含まれています。このデータセットは、画像を細かく分類するタスクのために、ImageNet の画像と注釈を使用して構築されています。 20,580 枚の画像があり、そのうち 12,000 枚がトレーニングに使用され、8580 枚がテストに使用されます。クラスラベルとバウンディングボックスの注釈は、12,000 枚の画像すべてに提供されます。

追加のドキュメント:コードを使用したペーパーの探索
ホームページ: http://vision.stanford.edu/aditya86/ImageNetDogs/main.html
ソースコード: tfds.datasets.stanford_dogs.Builder
バージョン:
- 0.2.0 (デフォルト): リリースノートはありません。
ダウンロードサイズ: 778.12 MiB
データセットのサイズ: 744.72 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'test'`	8,580
`'train'`	12,000

機能構造:

FeaturesDict({
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'image/filename': Text(shape=(), dtype=string),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=120),
    'objects': Sequence({
        'bbox': BBoxFeature(shape=(4,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
画像	画像	(なし、なし、3)	uint8
画像/ファイル名	文章		ストリング
ラベル	クラスラベル		int64
オブジェクト	順序
オブジェクト/bbox	BBoxFeature	(4)	float32

監視されたキー( as_supervised docを参照): ('image', 'label')
図( tfds.show_examples ):

視覚化

例( tfds.as_dataframe ):

引用：

@inproceedings{KhoslaYaoJayadevaprakashFeiFei_FGVC2011,
author = "Aditya Khosla and Nityananda Jayadevaprakash and Bangpeng Yao and
          Li Fei-Fei",
title = "Novel Dataset for Fine-Grained Image Categorization",
booktitle = "First Workshop on Fine-Grained Visual Categorization,
             IEEE Conference on Computer Vision and Pattern Recognition",
year = "2011",
month = "June",
address = "Colorado Springs, CO",
}
@inproceedings{imagenet_cvpr09,
        AUTHOR = {Deng, J. and Dong, W. and Socher, R. and Li, L.-J. and
                  Li, K. and Fei-Fei, L.},
        TITLE = { {ImageNet: A Large-Scale Hierarchical Image Database} },
        BOOKTITLE = {CVPR09},
        YEAR = {2009},
        BIBSOURCE = "http://www.image-net.org/papers/imagenet_cvpr09.bib"}

stanford_dogs コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

stanford_dogs