스몰노브

설명 :

이 데이터베이스는 모양을 통한 3D 물체 인식 실험을 위한 것입니다. 여기에는 네 발 달린 동물, 인물, 비행기, 트럭, 자동차 등 5가지 일반적인 카테고리에 속하는 50개의 장난감 이미지가 포함되어 있습니다. 물체는 6가지 조명 조건, 9가지 고도(5도마다 30~70도), 18방위(20도마다 0~340)에서 두 대의 카메라로 이미지화되었습니다.

훈련 세트는 각 카테고리별로 5개의 인스턴스(인스턴스 4, 6, 7, 8, 9)와 나머지 5개의 인스턴스(인스턴스 0, 1, 2, 3, 5)로 구성된 테스트 세트로 구성됩니다.

추가 문서 : 코드 가 포함된 논문 탐색
홈페이지 : https://cs.nyu.edu/~ylclab/data/norb-v1.0-small/
소스 코드 : tfds.datasets.smallnorb.Builder
버전 :
- 2.0.0 (기본값): 새로운 분할 API( https://tensorflow.org/datasets/splits )
- 2.1.0 : 릴리스 노트가 없습니다.
다운로드 크기 : 250.60 MiB
데이터 세트 크기 : Unknown size
자동 캐시 ( 문서 ): 알 수 없음
분할 :

나뉘다	예
`'test'`	24,300
`'train'`	24,300

기능 구조 :

FeaturesDict({
    'image': Image(shape=(96, 96, 1), dtype=uint8),
    'image2': Image(shape=(96, 96, 1), dtype=uint8),
    'instance': ClassLabel(shape=(), dtype=int64, num_classes=10),
    'label_azimuth': ClassLabel(shape=(), dtype=int64, num_classes=18),
    'label_category': ClassLabel(shape=(), dtype=int64, num_classes=5),
    'label_elevation': ClassLabel(shape=(), dtype=int64, num_classes=9),
    'label_lighting': ClassLabel(shape=(), dtype=int64, num_classes=6),
})

기능 문서 :

특징	수업	모양	Dtype
	특징Dict
영상	영상	(96, 96, 1)	uint8
이미지 2	영상	(96, 96, 1)	uint8
사례	클래스 라벨		정수64
label_방위각	클래스 라벨		정수64
라벨_카테고리	클래스 라벨		정수64
label_elevation	클래스 라벨		정수64
라벨_조명	클래스 라벨		정수64

감독되는 키 ( as_supervised doc 참조): ('image', 'label_category')
그림 ( tfds.show_examples ): 지원되지 않습니다.
예 ( tfds.as_dataframe ):

인용 :

@article{LeCun2004LearningMF,
  title={Learning methods for generic object recognition with invariance to pose and lighting},
  author={Yann LeCun and Fu Jie Huang and L{\'e}on Bottou},
  journal={Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition},
  year={2004},
  volume={2},
  pages={II-104 Vol.2}
}