View source on GitHub |
Create a Trajectory transitioning between StepTypes LAST
and FIRST
.
tf_agents.trajectories.boundary(
observation: tf_agents.typing.types.NestedSpecTensorOrArray
,
action: tf_agents.typing.types.NestedSpecTensorOrArray
,
policy_info: tf_agents.typing.types.NestedSpecTensorOrArray
,
reward: tf_agents.typing.types.NestedSpecTensorOrArray
,
discount: tf_agents.typing.types.SpecTensorOrArray
) -> tf_agents.trajectories.Trajectory
All inputs may be batched.
The input discount
is used to infer the outer shape of the inputs,
as it is always expected to be a singleton array with scalar inner shape.
Returns | |
---|---|
A Trajectory instance.
|