키티

설명 :

Kitti에는 자율 주행 플랫폼을 사용하여 구축된 비전 작업 제품군이 포함되어 있습니다. 전체 벤치마크에는 스테레오, 광학 흐름, 시각적 주행 거리 측정 등과 같은 많은 작업이 포함되어 있습니다. 이 데이터 세트에는 단안 이미지 및 경계 상자를 포함한 객체 감지 데이터 세트가 포함되어 있습니다. 데이터 세트에는 3D 경계 상자로 주석이 달린 7481개의 훈련 이미지가 포함되어 있습니다. 주석에 대한 전체 설명은 Kitti 홈페이지의 개체 개발 키트 추가 정보의 추가 정보에서 찾을 수 있습니다.

추가 문서 : 코드 가 포함된 논문 탐색
홈페이지 : http://www.cvlibs.net/datasets/kitti/
소스 코드 : tfds.datasets.kitti.Builder
버전 :
- 3.1.0 : 릴리스 노트가 없습니다.
- 3.2.0 : Devkit이 업데이트되었습니다.
- 3.3.0 (기본값): occluded 기능에 대한 레이블이 추가되었습니다.
다운로드 크기 : 11.71 GiB
데이터세트 크기 : 5.27 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'test'`	711
`'train'`	6,347
`'validation'`	423

기능 구조 :

FeaturesDict({
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'image/file_name': Text(shape=(), dtype=string),
    'objects': Sequence({
        'alpha': float32,
        'bbox': BBoxFeature(shape=(4,), dtype=float32, description=2D bounding box of object in the image),
        'dimensions': Tensor(shape=(3,), dtype=float32, description=3D object dimensions: height, width, length (in meters)),
        'location': Tensor(shape=(3,), dtype=float32, description=3D object location x,y,z in camera coordinates (in meters)),
        'occluded': ClassLabel(shape=(), dtype=int64, num_classes=4),
        'rotation_y': float32,
        'truncated': float32,
        'type': ClassLabel(shape=(), dtype=int64, num_classes=8),
    }),
})

기능 문서 :

특징	수업	모양	Dtype	설명
	특징Dict
영상	영상	(없음, 없음, 3)	uint8
이미지/파일_이름	텍스트		끈
사물	순서
객체/알파	텐서		float32	물체의 관찰 각도, 범위 [-pi..pi]
객체/bbox	B박스특징	(4,)	float32	이미지 내 객체의 2D 경계 상자
개체/차원	텐서	(3,)	float32	3D 개체 크기: 높이, 너비, 길이(미터)
객체/위치	텐서	(3,)	float32	카메라 좌표의 3D 객체 위치 x,y,z(미터)
객체/폐쇄됨	클래스 라벨		정수64	폐색 상태를 나타내는 정수(0,1,2,3): 0 = 완전히 표시됨, 1 = 부분적으로 가려짐2 = 크게 가려짐, 3 = 알 수 없음
객체/rotation_y	텐서		float32	카메라 좌표에서 Y축을 중심으로 한 회전 [-pi..pi]
객체/잘림	텐서		float32	0(잘리지 않음)에서 1(잘림)까지 부동합니다. 여기서 잘림은 이미지 경계를 벗어나는 객체를 나타냅니다.
객체/유형	클래스 라벨		정수64	객체 유형(예: 'Car' 또는 'Van')

감독되는 키 ( as_supervised doc 참조): None
그림 ( tfds.show_examples ):

예 ( tfds.as_dataframe ):

인용 :

@inproceedings{Geiger2012CVPR,
  author = {Andreas Geiger and Philip Lenz and Raquel Urtasun},
  title = {Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite},
  booktitle = {Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2012}
}