wake_vision

תיאור :

Wake Vision הוא מערך נתונים גדול ואיכותי הכולל למעלה מ-6 מיליון תמונות, החורג משמעותית מהקנה המידה והמגוון של מערכי נתונים זעירים ML הנוכחיים (פי 100). מערך נתונים זה כולל תמונות עם הערות לגבי האם כל תמונה מכילה אדם. בנוסף, הוא משלב מדד דק מקיף להערכת הוגנות וחוסן, המכסה מגדר נתפס, גיל נתפס, מרחק נושא, תנאי תאורה ותיאורים. התוויות של Wake Vision נגזרות מההערות של Open Image המורשות על ידי Google LLC תחת רישיון CC BY 4.0. התמונות מופיעות כבעלות רישיון CC BY 2.0. הערה מ-Open Images: "בעוד שניסינו לזהות תמונות ברישיון Creative Commons Attribution, אנחנו לא נותנים מצגים או התחייבויות לגבי סטטוס הרישיון של כל תמונה ועליך לאמת את הרישיון עבור כל תמונה בעצמך."

דף הבית : https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi%3A10.7910%2FDVN%2F1HOPXC
קוד מקור : tfds.datasets.wake_vision.Builder
גרסאות :
- 1.0.0 (ברירת מחדל): שחרור ראשוני של ערכות נתונים של TensorFlow. שימו לב שזה מבוסס על גרסת 2.0 של Wake Vision ב-Harvard Dataverse.
גודל הורדה : Unknown size
גודל מערך נתונים : 239.25 GiB
שמירה אוטומטית במטמון ( תיעוד ): לא
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	55,763
`'train_large'`	5,760,428
`'train_quality'`	1,248,230
`'validation'`	18,582

מבנה תכונה :

FeaturesDict({
    'age_unknown': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'body_part': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'bright': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'dark': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'far': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'filename': Text(shape=(), dtype=string),
    'gender_unknown': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'medium_distance': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'middle_age': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'near': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'non-person_depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'non-person_non-depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'normal_lighting': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'older': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'person': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'person_depiction': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'predominantly_female': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'predominantly_male': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'young': ClassLabel(shape=(), dtype=int64, num_classes=2),
})

תיעוד תכונה :

תכונה	מַחלָקָה	צוּרָה	Dtype
	FeaturesDict
age_unknown	ClassLabel		int64
חלק_גוף	ClassLabel		int64
בָּהִיר	ClassLabel		int64
כֵּהֶה	ClassLabel		int64
תֵאוּר	ClassLabel		int64
רָחוֹק	ClassLabel		int64
שם הקובץ	טֶקסט		חוּט
מגדר_לא ידוע	ClassLabel		int64
תְמוּנָה	תְמוּנָה	(אין, אין, 3)	uint8
מרחק_בינוני	ClassLabel		int64
גיל העמידה	ClassLabel		int64
לְיַד	ClassLabel		int64
תיאור_לא-אדם	ClassLabel		int64
non-person_non-depiction	ClassLabel		int64
תאורה רגילה	ClassLabel		int64
מבוגרים יותר	ClassLabel		int64
אָדָם	ClassLabel		int64
תיאור_אדם	ClassLabel		int64
בעיקר_נקבה	ClassLabel		int64
בעיקר_זכר	ClassLabel		int64
צָעִיר	ClassLabel		int64

מפתחות בפיקוח (ראה as_supervised doc ): ('image', 'person')
איור ( tfds.show_examples ):

רְאִיָה

דוגמאות ( tfds.as_dataframe ): חסר.
ציטוט :

@article{banbury2024wake,
  title={Wake Vision: A Large-scale, Diverse Dataset and Benchmark Suite for TinyML Person Detection},
  author={Banbury, Colby and Njor, Emil and Stewart, Matthew and Warden, Pete and Kudlur, Manjunath and Jeffries, Nat and Fafoutis, Xenofon and Reddi, Vijay Janapa},
  journal={arXiv preprint arXiv:2405.00892},
  year={2024}
}