Create a transition from a trajectory or two adjacent trajectories.
View aliases
Main aliases
tf_agents.trajectories.to_transition(
trajectory: tf_agents.trajectories.Trajectory
,
next_trajectory: Optional[tf_agents.trajectories.Trajectory
] = None
) -> tf_agents.trajectories.Transition
time_steps.step_type = trajectory.step_type[:,:-1]
time_steps.observation = trajectory.observation[:,:-1]
next_time_steps.observation = trajectory.observation[:,1:]
next_time_steps. step_type = trajectory. next_step_type[:,:-1]
next_time_steps.reward = trajectory.reward[:,:-1]
next_time_steps. discount = trajectory. discount[:,:-1]
Notice that reward and discount for time_steps are undefined, therefore filled with zero.
Returns | |
---|---|
A tuple (time_steps, policy_steps, next_time_steps) . The reward and
discount fields of time_steps are filled with zeros because these
cannot be deduced (please do not use them).
|