pass
Stay organized with collections
Save and categorize content based on your preferences.
PASS is a large-scale image dataset that does not include any humans, human
parts, or other personally identifiable information. It can be used for
high-quality self-supervised pretraining while significantly reducing privacy
concerns.
PASS contains 1,439,589 images without any labels sourced from YFCC-100M.
All images in this dataset are licenced under the CC-BY licence, as is the
dataset itself. For YFCC-100M see http://www.multimediacommons.org/
Split |
Examples |
'train' |
1,439,588 |
FeaturesDict({
'image': Image(shape=(None, None, 3), dtype=uint8),
'image/creator_uname': Text(shape=(), dtype=string),
'image/date_taken': Text(shape=(), dtype=string),
'image/gps_lat': float32,
'image/gps_lon': float32,
'image/hash': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
image |
Image |
(None, None, 3) |
uint8 |
|
image/creator_uname |
Text |
|
string |
|
image/date_taken |
Text |
|
string |
|
image/gps_lat |
Tensor |
|
float32 |
|
image/gps_lon |
Tensor |
|
float32 |
|
image/hash |
Text |
|
string |
|

@Article{asano21pass,
author = "Yuki M. Asano and Christian Rupprecht and Andrew Zisserman and Andrea Vedaldi",
title = "PASS: An ImageNet replacement for self-supervised pretraining without humans",
journal = "NeurIPS Track on Datasets and Benchmarks",
year = "2021"
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-06-01 UTC.
[null,null,["Last updated 2024-06-01 UTC."],[],[],null,["# pass\n\n\u003cbr /\u003e\n\n- **Description**:\n\nPASS is a large-scale image dataset that does not include any humans, human\nparts, or other personally identifiable information. It can be used for\nhigh-quality self-supervised pretraining while significantly reducing privacy\nconcerns.\n\nPASS contains 1,439,589 images without any labels sourced from YFCC-100M.\n\nAll images in this dataset are licenced under the CC-BY licence, as is the\ndataset itself. For YFCC-100M see \u003chttp://www.multimediacommons.org/\u003e\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/pass)\n\n- **Homepage** :\n [https://www.robots.ox.ac.uk/\\~vgg/data/pass/](https://www.robots.ox.ac.uk/%7Evgg/data/pass/)\n\n- **Source code** :\n [`tfds.datasets.pass.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/pass/pass_dataset_builder.py)\n\n- **Versions**:\n\n - `1.0.0`: Initial release.\n - `2.0.0`: v2: Removed 472 images from v1 as they contained humans. Also added metadata: datetaken and GPS.\n - **`3.0.0`** (default): v3: Removed 131 images from v2 as they contained humans/tattos.\n- **Download size** : `167.30 GiB`\n\n- **Dataset size** : `166.43 GiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|-----------|\n| `'train'` | 1,439,588 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'image': Image(shape=(None, None, 3), dtype=uint8),\n 'image/creator_uname': Text(shape=(), dtype=string),\n 'image/date_taken': Text(shape=(), dtype=string),\n 'image/gps_lat': float32,\n 'image/gps_lon': float32,\n 'image/hash': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|---------------------|--------------|-----------------|---------|-------------|\n| | FeaturesDict | | | |\n| image | Image | (None, None, 3) | uint8 | |\n| image/creator_uname | Text | | string | |\n| image/date_taken | Text | | string | |\n| image/gps_lat | Tensor | | float32 | |\n| image/gps_lon | Tensor | | float32 | |\n| image/hash | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @Article{asano21pass,\n author = \"Yuki M. Asano and Christian Rupprecht and Andrew Zisserman and Andrea Vedaldi\",\n title = \"PASS: An ImageNet replacement for self-supervised pretraining without humans\",\n journal = \"NeurIPS Track on Datasets and Benchmarks\",\n year = \"2021\"\n }"]]