참고자료:
지역_설명_v1.0.0
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:visual_genome/region_descriptions_v1.0.0')
- 설명 :
Visual Genome enable to model objects and relationships between objects.
They collect dense annotations of objects, attributes, and relationships within each image.
Specifically, the dataset contains over 108K images where each image has an average of 35 objects, 26 attributes, and 21 pairwise relationships between objects.
- 라이센스 : Creative Commons Attribution 4.0 국제 라이센스
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 108077 |
- 특징 :
{
"image": {
"decode": true,
"id": null,
"_type": "Image"
},
"image_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"width": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"height": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"coco_id": {
"dtype": "int64",
"id": null,
"_type": "Value"
},
"flickr_id": {
"dtype": "int64",
"id": null,
"_type": "Value"
},
"regions": [
{
"region_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"image_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"phrase": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"x": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"y": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"width": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"height": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
]
}