dmlab

설명 :

Dmlab 데이터 세트에는 DeepMind Lab 환경에서 작동하는 에이전트가 관찰한 프레임이 포함되어 있으며, 에이전트와 환경에 존재하는 다양한 개체 사이의 거리에 대한 주석이 추가되어 있습니다. 목표는 3D 환경에서 시각적 입력으로부터의 거리를 추론하는 시각적 모델의 능력을 평가하는 것입니다. Dmlab 데이터 세트는 6개 클래스의 360x480 컬러 이미지로 구성됩니다. 클래스는 각각 {가까운, 먼, 아주 먼} x {긍정적 보상, 부정적인 보상}입니다.

홈페이지 : https://github.com/google-research/task_adaptation
소스코드 : tfds.image_classification.Dmlab
버전 :
- 2.0.1 (기본값): 릴리스 노트가 없습니다.
다운로드 크기 : 2.81 GiB
데이터세트 크기 : 3.13 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'test'`	22,735
`'train'`	65,550
`'validation'`	22,628

기능 구조 :

FeaturesDict({
    'filename': Text(shape=(), dtype=string),
    'image': Image(shape=(360, 480, 3), dtype=uint8),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=6),
})

기능 문서 :

특징	수업	모양	Dtype
	특징Dict
파일 이름	텍스트		끈
영상	영상	(360, 480, 3)	uint8
상표	클래스 라벨		정수64

감독되는 키 ( as_supervised doc 참조): ('image', 'label')
그림 ( tfds.show_examples ):

예 ( tfds.as_dataframe ):

인용 :

@article{zhai2019visual,
        title={The Visual Task Adaptation Benchmark},
        author={Xiaohua Zhai and Joan Puigcerver and Alexander Kolesnikov and
               Pierre Ruyssen and Carlos Riquelme and Mario Lucic and
               Josip Djolonga and Andre Susano Pinto and Maxim Neumann and
               Alexey Dosovitskiy and Lucas Beyer and Olivier Bachem and
               Michael Tschannen and Marcin Michalski and Olivier Bousquet and
               Sylvain Gelly and Neil Houlsby},
                              year={2019},
                              eprint={1910.04867},
                              archivePrefix={arXiv},
                              primaryClass={cs.CV},
                              url = {https://arxiv.org/abs/1910.04867}
                          }