ウェイクビジョン

説明：

Wake Vision は、600 万を超える画像を特徴とする大規模で高品質のデータセットであり、現在の tinyML データセット (100 倍) の規模と多様性を大幅に超えています。このデータセットには、各画像に人物が含まれているかどうかの注釈が付いた画像が含まれています。さらに、公平性と堅牢性を評価するための包括的なきめの細かいベンチマークが組み込まれており、知覚される性別、知覚される年齢、被写体の距離、照明条件、描写をカバーします。 Wake Vision ラベルは、CC BY 4.0 ライセンスに基づいて Google LLC からライセンスを取得した Open Image の注釈から派生しています。画像は CC BY 2.0 ライセンスを持つものとしてリストされています。 Open Images からの注記: 「クリエイティブコモンズ表示ライセンスに基づいてライセンスが付与されている画像を特定しようとしましたが、各画像のライセンスステータスに関していかなる表明も保証も行いません。各画像のライセンスはご自身で確認してください。」

ホームページ: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi%3A10.7910%2FDVN%2F1HOPXC
ソースコード: tfds.datasets.wake_vision.Builder
バージョン:
- 1.0.0 (デフォルト): TensorFlow データセットの初期リリース。これは Harvard Dataverse の Wake Vision バージョン 2.0 に基づいていることに注意してください。
ダウンロードサイズ: Unknown size
データセットのサイズ: 239.25 GiB
自動キャッシュ(ドキュメント): いいえ
分割:

スプリット	例
`'test'`	55,763
`'train_large'`	5,760,428
`'train_quality'`	1,248,230
`'validation'`	18,582

機能の構造:

FeaturesDict({
    'age_unknown': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'body_part': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'bright': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'dark': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'far': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'filename': Text(shape=(), dtype=string),
    'gender_unknown': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'medium_distance': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'middle_age': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'near': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'non-person_depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'non-person_non-depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'normal_lighting': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'older': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'person': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'person_depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'predominantly_female': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'predominantly_male': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'young': ClassLabel(shape=(), dtype=int64, num_classes=2),
})

機能ドキュメント:

特徴	クラス	形	Dタイプ
	特徴辞書
年齢不明	クラスラベル		int64
ボディパーツ	クラスラベル		int64
明るい	クラスラベル		int64
暗い	クラスラベル		int64
描写	クラスラベル		int64
遠い	クラスラベル		int64
ファイル名	文章		弦
性別不明	クラスラベル		int64
画像	画像	(なし、なし、3)	uint8
中距離	クラスラベル		int64
中年	クラスラベル		int64
近く	クラスラベル		int64
人物以外の描写	クラスラベル		int64
非人物_非描写	クラスラベル		int64
通常_照明	クラスラベル		int64
年上の	クラスラベル		int64
人	クラスラベル		int64
人物の描写	クラスラベル		int64
主に女性	クラスラベル		int64
主に男性	クラスラベル		int64
若い	クラスラベル		int64

監視キー( as_supervised doc を参照): ('image', 'person')
図( tfds.show_examples ):

視覚化

例( tfds.as_dataframe ):

引用：

@article{banbury2024wake,
  title={Wake Vision: A Large-scale, Diverse Dataset and Benchmark Suite for TinyML Person Detection},
  author={Banbury, Colby and Njor, Emil and Stewart, Matthew and Warden, Pete and Kudlur, Manjunath and Jeffries, Nat and Fafoutis, Xenofon and Reddi, Vijay Janapa},
  journal={arXiv preprint arXiv:2405.00892},
  year={2024}
}

ウェイクビジョン コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

ウェイクビジョン