بخش_هرچیزی

توضیحات :

SA-1B دانلود کنید

Segment Anything 1 Billion (SA-1B) مجموعه داده ای است که برای آموزش مدل های تقسیم بندی اشیاء همه منظوره از تصاویر جهان باز طراحی شده است. مجموعه داده در مقاله "Segment Anything" معرفی شد.

مجموعه داده SA-1B شامل 11 میلیون تصویر متنوع، با وضوح بالا، دارای مجوز و محافظت از حریم خصوصی و حاشیه نویسی ماسک 1.1B است. ماسک‌ها در قالب کدگذاری طول اجرا (RLE) COCO ارائه می‌شوند و کلاس ندارند.

مجوز سفارشی است. لطفا شرایط و ضوابط کامل را در https://ai.facebook.com/datasets/segment-anything-downloads بخوانید

همه ویژگی ها در مجموعه داده اصلی هستند به جز image.content (محتوای تصویر).

شما می توانید ماسک های تقسیم بندی را با موارد زیر رمزگشایی کنید:

import tensorflow_datasets as tfds

pycocotools = tfds.core.lazy_imports.pycocotools

ds = tfds.load('segment_anything', split='train')
for example in tfds.as_numpy(ds):
  segmentation = example['annotations']['segmentation']
  for counts, size in zip(segmentation['counts'], segmentation['size']):
    encoded_mask = {'size': size, 'counts': counts}
    mask = pycocotools.decode(encoded_mask)  # np.array(dtype=uint8) mask
    ...

صفحه اصلی : https://ai.facebook.com/datasets/segment-anything-downloads
کد منبع : tfds.datasets.segment_anything.Builder
نسخه ها :
- 1.0.0 (پیش فرض): انتشار اولیه.
حجم دانلود : 10.28 TiB
اندازه مجموعه داده : 10.59 TiB
دستورالعمل‌های دانلود دستی : این مجموعه داده از شما می‌خواهد که داده‌های منبع را به صورت دستی در download_config.manual_dir دانلود کنید (پیش‌فرض ~/tensorflow_datasets/downloads/manual/ ):
فایل پیوندها را از https://ai.facebook.com/datasets/segment-anything-downloads manual_dir دانلود کنید باید حاوی فایل پیوندهایی باشد که به عنوان segment_anything_links.txt ذخیره شده است.
ذخیره خودکار ( اسناد ): خیر
تقسیم ها :

تقسیم کنید	نمونه ها
`'train'`	11,185,362

ساختار ویژگی :

FeaturesDict({
    'annotations': Sequence({
        'area': Scalar(shape=(), dtype=uint64, description=The area in pixels of the mask.),
        'bbox': BBoxFeature(shape=(4,), dtype=float32, description=The box around the mask, in TFDS format.),
        'crop_box': BBoxFeature(shape=(4,), dtype=float32, description=The crop of the image used to generate the mask, in TFDS format.),
        'id': Scalar(shape=(), dtype=uint64, description=Identifier for the annotation.),
        'point_coords': Tensor(shape=(1, 2), dtype=float64, description=The point coordinates input to the model to generate the mask.),
        'predicted_iou': Scalar(shape=(), dtype=float64, description=The model's own prediction of the mask's quality.),
        'segmentation': FeaturesDict({
            'counts': string,
            'size': Tensor(shape=(2,), dtype=uint64),
        }),
        'stability_score': Scalar(shape=(), dtype=float64, description=A measure of the mask's quality.),
    }),
    'image': FeaturesDict({
        'content': Image(shape=(None, None, 3), dtype=uint8, description=Content of the image.),
        'file_name': string,
        'height': uint64,
        'image_id': uint64,
        'width': uint64,
    }),
})

مستندات ویژگی :

ویژگی	کلاس	شکل	نوع D	توضیحات
	FeaturesDict
حاشیه نویسی	دنباله
حاشیه نویسی/منطقه	اسکالر		uint64	ناحیه بر حسب پیکسل ماسک.
حاشیه نویسی/bbox	ویژگی BBox	(4،)	float32	جعبه دور ماسک، با فرمت TFDS.
حاشیه نویسی/crop_box	ویژگی BBox	(4،)	float32	برش تصویر مورد استفاده برای تولید ماسک، در قالب TFDS.
حاشیه نویسی / شناسه	اسکالر		uint64	شناسه برای حاشیه نویسی.
حاشیه نویسی/point_coords	تانسور	(1، 2)	float64	نقطه، ورودی مدل را برای تولید ماسک مختصات می کند.
حاشیه نویسی/predicted_iou	اسکالر		float64	پیش بینی خود مدل از کیفیت ماسک.
حاشیه نویسی / بخش بندی	FeaturesDict			ماسک تقسیم بندی رمزگذاری شده در قالب COCO RLE (دیکت با `size` کلیدها و `counts` ).
حاشیه نویسی / بخش بندی / شمارش	تانسور		رشته
حاشیه نویسی / بخش بندی / اندازه	تانسور	(2،)	uint64
حاشیه نویسی/پایداری_نمره	اسکالر		float64	معیاری برای سنجش کیفیت ماسک
تصویر	FeaturesDict
تصویر/محتوا	تصویر	(هیچ، هیچ، 3)	uint8	محتوای تصویر.
تصویر/نام_فایل	تانسور		رشته
تصویر/ارتفاع	تانسور		uint64
image/image_id	تانسور		uint64
تصویر/عرض	تانسور		uint64

کلیدهای نظارت شده (به as_supervised doc مراجعه کنید): None
شکل ( tfds.show_examples ): پشتیبانی نمی شود.
مثال‌ها ( tfds.as_dataframe ):

نقل قول :

@misc{kirillov2023segment,
  title={Segment Anything},
  author={Alexander Kirillov and Eric Mintun and Nikhila Ravi and Hanzi Mao and Chloe Rolland and Laura Gustafson and Tete Xiao and Spencer Whitehead and Alexander C. Berg and Wan-Yen Lo and Piotr Dollár and Ross Girshick},
  year={2023},
  eprint={2304.02643},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}