symmetric_solids
Stay organized with collections
Save and categorize content based on your preferences.
This is a pose estimation dataset, consisting of symmetric 3D shapes where
multiple orientations are visually indistinguishable. The challenge is to
predict all equivalent orientations when only one orientation is paired with
each image during training (as is the scenario for most pose estimation
datasets). In contrast to most pose estimation datasets, the full set of
equivalent orientations is available for evaluation.
There are eight shapes total, each rendered from 50,000 viewpoints distributed
uniformly at random over the full space of 3D rotations. Five of the shapes are
featureless -- tetrahedron, cube, icosahedron, cone, and cylinder. Of those, the
three Platonic solids (tetrahedron, cube, icosahedron) are annotated with their
12-, 24-, and 60-fold discrete symmetries, respectively. The cone and cylinder
are annotated with their continuous symmetries discretized at 1 degree
intervals. These symmetries are provided for evaluation; the intended
supervision is only a single rotation with each image.
The remaining three shapes are marked with a distinguishing feature. There is a
tetrahedron with one red-colored face, a cylinder with an off-center dot, and a
sphere with an X capped by a dot. Whether or not the distinguishing feature is
visible, the space of possible orientations is reduced. We do not provide the
set of equivalent rotations for these shapes.
Each example contains of
- the 224x224 RGB image
a shape index so that the dataset may be filtered by shape.
The indices correspond to:
- 0 = tetrahedron
- 1 = cube
- 2 = icosahedron
- 3 = cone
- 4 = cylinder
- 5 = marked tetrahedron
- 6 = marked cylinder
- 7 = marked sphere
the rotation used in the rendering process, represented as a 3x3 rotation
matrix
the set of known equivalent rotations under symmetry, for evaluation.
In the case of the three marked shapes, this is only the rendering rotation.
Split |
Examples |
'test' |
40,000 |
'train' |
360,000 |
FeaturesDict({
'image': Image(shape=(224, 224, 3), dtype=uint8),
'label_shape': ClassLabel(shape=(), dtype=int64, num_classes=8),
'rotation': Tensor(shape=(3, 3), dtype=float32),
'rotations_equivalent': Tensor(shape=(None, 3, 3), dtype=float32),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
image |
Image |
(224, 224, 3) |
uint8 |
|
label_shape |
ClassLabel |
|
int64 |
|
rotation |
Tensor |
(3, 3) |
float32 |
|
rotations_equivalent |
Tensor |
(None, 3, 3) |
float32 |
|

@inproceedings{implicitpdf2021,
title = {Implicit Representation of Probability Distributions on the Rotation
Manifold},
author = {Murphy, Kieran and Esteves, Carlos and Jampani, Varun and
Ramalingam, Srikumar and Makadia, Ameesh}
booktitle = {International Conference on Machine Learning}
year = {2021}
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-11-23 UTC.
[null,null,["Last updated 2022-11-23 UTC."],[],[],null,["# symmetric_solids\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThis is a pose estimation dataset, consisting of symmetric 3D shapes where\nmultiple orientations are visually indistinguishable. The challenge is to\npredict all equivalent orientations when only one orientation is paired with\neach image during training (as is the scenario for most pose estimation\ndatasets). In contrast to most pose estimation datasets, the full set of\nequivalent orientations is available for evaluation.\n\nThere are eight shapes total, each rendered from 50,000 viewpoints distributed\nuniformly at random over the full space of 3D rotations. Five of the shapes are\nfeatureless -- tetrahedron, cube, icosahedron, cone, and cylinder. Of those, the\nthree Platonic solids (tetrahedron, cube, icosahedron) are annotated with their\n12-, 24-, and 60-fold discrete symmetries, respectively. The cone and cylinder\nare annotated with their continuous symmetries discretized at 1 degree\nintervals. These symmetries are provided for evaluation; the intended\nsupervision is only a single rotation with each image.\n\nThe remaining three shapes are marked with a distinguishing feature. There is a\ntetrahedron with one red-colored face, a cylinder with an off-center dot, and a\nsphere with an X capped by a dot. Whether or not the distinguishing feature is\nvisible, the space of possible orientations is reduced. We do not provide the\nset of equivalent rotations for these shapes.\n\nEach example contains of\n\n- the 224x224 RGB image\n- a shape index so that the dataset may be filtered by shape. \n\n The indices correspond to:\n\n - 0 = tetrahedron\n - 1 = cube\n - 2 = icosahedron\n - 3 = cone\n - 4 = cylinder\n - 5 = marked tetrahedron\n - 6 = marked cylinder\n - 7 = marked sphere\n- the rotation used in the rendering process, represented as a 3x3 rotation\n matrix\n\n- the set of known equivalent rotations under symmetry, for evaluation.\n\nIn the case of the three marked shapes, this is only the rendering rotation.\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/symmetric-solids)\n\n- **Homepage** :\n \u003chttps://implicit-pdf.github.io\u003e\n\n- **Source code** :\n [`tfds.datasets.symmetric_solids.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/symmetric_solids/symmetric_solids_dataset_builder.py)\n\n- **Versions**:\n\n - **`1.0.0`** (default): Initial release.\n- **Download size** : `3.10 GiB`\n\n- **Dataset size** : `3.94 GiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 40,000 |\n| `'train'` | 360,000 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'image': Image(shape=(224, 224, 3), dtype=uint8),\n 'label_shape': ClassLabel(shape=(), dtype=int64, num_classes=8),\n 'rotation': Tensor(shape=(3, 3), dtype=float32),\n 'rotations_equivalent': Tensor(shape=(None, 3, 3), dtype=float32),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------------------|--------------|---------------|---------|-------------|\n| | FeaturesDict | | | |\n| image | Image | (224, 224, 3) | uint8 | |\n| label_shape | ClassLabel | | int64 | |\n| rotation | Tensor | (3, 3) | float32 | |\n| rotations_equivalent | Tensor | (None, 3, 3) | float32 | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('image', 'rotation')`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @inproceedings{implicitpdf2021,\n title = {Implicit Representation of Probability Distributions on the Rotation\n Manifold},\n author = {Murphy, Kieran and Esteves, Carlos and Jampani, Varun and\n Ramalingam, Srikumar and Makadia, Ameesh}\n booktitle = {International Conference on Machine Learning}\n year = {2021}\n }"]]