TFDS는 이제 Croissant 🥐 형식을 지원합니다! 자세한 내용은 설명서를 읽어보세요.

이 페이지는 Cloud Translation API를 통해 번역되었습니다.

rlu_rwrl

설명 :

RL Unplugged는 오프라인 강화 학습을 위한 벤치마크 모음입니다. RL Unplugged는 다음 고려 사항을 중심으로 설계되었습니다. 사용 편의성을 높이기 위해 일반 파이프라인이 설정되면 실무자가 제품군의 모든 데이터로 쉽게 작업할 수 있도록 하는 통합 API와 함께 데이터 세트를 제공합니다.

데이터 세트는 RLDS 형식 을 따라 단계와 에피소드를 나타냅니다.

데이터세트의 예는 https://arxiv.org/abs/1904.12901에 설명된 대로 부분적으로 온라인으로 훈련된 에이전트를 실행할 때 저장된 SAR 전환을 나타냅니다. 우리는 https://github.com/google-research 에 지정된 RLDS 데이터세트 형식을 따릅니다. /rlds#dataset-format

총 8가지 작업에 대한 40개의 데이터 세트를 공개합니다. 카트폴, 보행기, 네 발 달린 동물 및 휴머노이드 작업에 대한 결합된 도전과 쉬운 결합 도전이 없습니다. 각 작업에는 1%, 5%, 20%, 40% 및 100%의 5가지 크기의 데이터 세트가 포함됩니다. 더 작은 데이터 세트가 더 큰 데이터 세트의 하위 집합이 되는 것은 아닙니다. 데이터 세트 생성 방법에 대한 자세한 내용은 논문을 참조하십시오.

홈페이지 : https://github.com/deepmind/deepmind-research/tree/master/rl_unplugged
소스 코드 : tfds.rl_unplugged.rlu_rwrl.RluRwrl
버전 :
- 1.0.0 : 최초 릴리스.
- 1.0.1 (기본값): 휴머노이드 데이터세트 중 하나에 중복된 에피소드 ID가 있는 RLU RWRL 데이터세트의 버그를 수정합니다.
다운로드 크기 : Unknown size
감독된 키 ( as_supervised 문서 참조): None
그림 ( tfds.show_examples ): 지원되지 않습니다.
인용 :

@misc{gulcehre2020rl,
    title={RL Unplugged: Benchmarks for Offline Reinforcement Learning},
    author={Caglar Gulcehre and Ziyu Wang and Alexander Novikov and Tom Le Paine
        and  Sergio Gómez Colmenarejo and Konrad Zolna and Rishabh Agarwal and
        Josh Merel and Daniel Mankowitz and Cosmin Paduraru and Gabriel
        Dulac-Arnold and Jerry Li and Mohammad Norouzi and Matt Hoffman and
        Ofir Nachum and George Tucker and Nicolas Heess and Nando deFreitas},
    year={2020},
    eprint={2006.13888},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

rlu_rwrl/cartpole_swingup_combined_challenge_none_1_percent(기본 구성)

데이터 세트 크기 : 172.43 KiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	5

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_5_percent

데이터 세트 크기 : 862.13 KiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	25

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_20_percent

데이터 세트 크기 : 3.37 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	100

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_40_percent

데이터 세트 크기 : 6.74 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	200

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_100_percent

데이터 세트 크기 : 16.84 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	500

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_1_percent

데이터 세트 크기 : 1.77 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	5

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_5_percent

데이터 세트 크기 : 8.86 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	25

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_20_percent

데이터 세트 크기 : 35.46 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	100

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_40_percent

데이터 세트 크기 : 70.92 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	200

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_100_percent

데이터 세트 크기 : 177.29 MiB
자동 캐시 됨( 문서 ): shuffle_files=False (트레인)인 경우에만
분할 :

나뉘다	예
`'train'`	500

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_1_percent

데이터 세트 크기 : 6.27 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	50

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_5_percent

데이터 세트 크기 : 31.34 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	250

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_20_percent

데이터 세트 크기 : 125.37 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	1,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_40_percent

데이터 세트 크기 : 250.75 MiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	2,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_100_percent

데이터 세트 크기 : 626.86 MiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	5,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_1_percent

데이터 세트 크기 : 69.40 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	200

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_5_percent

데이터 세트 크기 : 346.98 MiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	1,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_20_percent

데이터세트 크기 : 1.36 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	4,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_40_percent

데이터세트 크기 : 2.71 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	8,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_100_percent

데이터세트 크기 : 6.78 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	20,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_1_percent

데이터 세트 크기 : 369.84 KiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	5

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_5_percent

데이터 세트 크기 : 1.81 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	25

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_20_percent

데이터 세트 크기 : 7.22 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	100

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_40_percent

데이터 세트 크기 : 14.45 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	200

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_100_percent

데이터 세트 크기 : 36.12 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	500

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(1,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/위치	텐서	(삼,)	float32
단계/관찰/속도	텐서	(2,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_1_percent

데이터 세트 크기 : 1.97 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	5

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_5_percent

데이터 세트 크기 : 9.83 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	25

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_20_percent

데이터 세트 크기 : 39.31 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	100

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_40_percent

데이터 세트 크기 : 78.63 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	200

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_100_percent

데이터 세트 크기 : 196.57 MiB
자동 캐시 됨( 문서 ): shuffle_files=False (트레인)인 경우에만
분할 :

나뉘다	예
`'train'`	500

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(12,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/egocentric_state	텐서	(44,)	float32
단계/관찰/force_torque	텐서	(24,)	float32
단계/관찰/imu	텐서	(6,)	float32
단계/관찰/torso_upright	텐서	(1,)	float32
걸음 수/관찰/torso_velocity	텐서	(삼,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_1_percent

데이터 세트 크기 : 8.20 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	50

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_5_percent

데이터 세트 크기 : 40.98 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	250

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_20_percent

데이터 세트 크기 : 163.93 MiB
자동 캐시 됨( 문서 ): shuffle_files=False (트레인)인 경우에만
분할 :

나뉘다	예
`'train'`	1,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_40_percent

데이터 세트 크기 : 327.86 MiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	2,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_100_percent

데이터 세트 크기 : 819.65 MiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	5,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(6,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
계단/관찰/높이	텐서	(1,)	float32
단계/관찰/방향	텐서	(14,)	float32
단계/관찰/속도	텐서	(9,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_1_percent

데이터 세트 크기 : 77.11 MiB
자동 캐시 ( 문서 ): 예
분할 :

나뉘다	예
`'train'`	200

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_5_percent

데이터 세트 크기 : 385.54 MiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	1,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_20_percent

데이터세트 크기 : 1.51 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	4,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_40_percent

데이터세트 크기 : 3.01 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	8,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_100_percent

데이터세트 크기 : 7.53 GiB
자동 캐시 ( 문서 ): 아니요
분할 :

나뉘다	예
`'train'`	20,000

기능 구조 :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

기능 문서 :

특징	수업	모양	D타입
	풍모Dict
episode_return	텐서		float32
단계	데이터 세트
단계/액션	텐서	(21,)	float32
단계/할인	텐서	(1,)	float32
단계/is_first	텐서		부울
단계/is_last	텐서		부울
단계/is_terminal	텐서		부울
단계/관찰	풍모Dict
단계/관찰/com_velocity	텐서	(삼,)	float32
단계/관찰/dummy-0	텐서	(1,)	float32
단계/관찰/dummy-1	텐서	(1,)	float32
단계/관찰/더미-2	텐서	(1,)	float32
단계/관찰/더미-3	텐서	(1,)	float32
단계/관찰/더미-4	텐서	(1,)	float32
단계/관찰/dummy-5	텐서	(1,)	float32
단계/관찰/dummy-6	텐서	(1,)	float32
단계/관찰/dummy-7	텐서	(1,)	float32
단계/관찰/dummy-8	텐서	(1,)	float32
단계/관찰/dummy-9	텐서	(1,)	float32
단계/관찰/사지	텐서	(12,)	float32
단계/관찰/head_height	텐서	(1,)	float32
단계/관측/joint_angles	텐서	(21,)	float32
단계/관찰/torso_vertical	텐서	(삼,)	float32
단계/관찰/속도	텐서	(27,)	float32
단계/보상	텐서	(1,)	float32

예 ( tfds.as_dataframe ):