i_naturalist2021
Stay organized with collections
Save and categorize content based on your preferences.
The iNaturalist dataset 2021 contains a total of 10,000 species. The full
training dataset contains nearly 2.7M images. To make the dataset more
accessible we have also created a "mini" training dataset with 50 examples per
species for a total of 500K images. The full training train
split overlaps
with the mini
split. The val set contains for each species 10 validation
images (100K in total). There are a total of 500,000 test images in the
public_test
split (without ground-truth labels).
Split |
Examples |
'mini' |
500,000 |
'test' |
500,000 |
'train' |
2,686,843 |
'val' |
100,000 |
FeaturesDict({
'file_id': Text(shape=(), dtype=string),
'id': Scalar(shape=(), dtype=int64),
'image': Image(shape=(None, None, 3), dtype=uint8),
'label': ClassLabel(shape=(), dtype=int64, num_classes=10000),
'supercategory': ClassLabel(shape=(), dtype=int64, num_classes=11),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
file_id |
Text |
|
string |
|
id |
Scalar |
|
int64 |
|
image |
Image |
(None, None, 3) |
uint8 |
|
label |
ClassLabel |
|
int64 |
|
supercategory |
ClassLabel |
|
int64 |
|

\
@misc{inaturalist21,
Howpublished = {~\url{https://github.com/visipedia/inat_comp/tree/master/2021} },
Title = { {iNaturalist} 2021 competition dataset.},
Year = {2021},
key = { {iNaturalist} 2021 competition dataset},
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2023-09-09 UTC.
[null,null,["Last updated 2023-09-09 UTC."],[],[],null,["# i_naturalist2021\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThe iNaturalist dataset 2021 contains a total of 10,000 species. The full\ntraining dataset contains nearly 2.7M images. To make the dataset more\naccessible we have also created a \"mini\" training dataset with 50 examples per\nspecies for a total of 500K images. The full training `train` split overlaps\nwith the `mini` split. The val set contains for each species 10 validation\nimages (100K in total). There are a total of 500,000 test images in the\n`public_test` split (without ground-truth labels).\n\n- **Homepage** :\n \u003chttps://github.com/visipedia/inat_comp/tree/master/2021\u003e\n\n- **Source code** :\n [`tfds.image_classification.i_naturalist2021.INaturalist2021`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/image_classification/i_naturalist2021/i_naturalist2021.py)\n\n- **Versions**:\n\n - `1.0.0`: Initial release.\n - `2.0.0`: Update: Class indices follow the order in the JSON train file.\n - **`2.0.1`** (default): Update: Include the example id as provided in the JSON files.\n- **Download size** : `316.54 GiB`\n\n- **Dataset size** : `318.45 GiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|-----------|\n| `'mini'` | 500,000 |\n| `'test'` | 500,000 |\n| `'train'` | 2,686,843 |\n| `'val'` | 100,000 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'file_id': Text(shape=(), dtype=string),\n 'id': Scalar(shape=(), dtype=int64),\n 'image': Image(shape=(None, None, 3), dtype=uint8),\n 'label': ClassLabel(shape=(), dtype=int64, num_classes=10000),\n 'supercategory': ClassLabel(shape=(), dtype=int64, num_classes=11),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|---------------|--------------|-----------------|--------|-------------|\n| | FeaturesDict | | | |\n| file_id | Text | | string | |\n| id | Scalar | | int64 | |\n| image | Image | (None, None, 3) | uint8 | |\n| label | ClassLabel | | int64 | |\n| supercategory | ClassLabel | | int64 | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('image', 'label')`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n \\\n @misc{inaturalist21,\n Howpublished = {~\\url{https://github.com/visipedia/inat_comp/tree/master/2021} },\n Title = { {iNaturalist} 2021 competition dataset.},\n Year = {2021},\n key = { {iNaturalist} 2021 competition dataset},\n }"]]