tf_agents.trajectories.termination

Returns a TimeStep with step_type set to StepType.LAST.

Main aliases

tf_agents.trajectories.time_step.termination

Used in the notebooks

observation A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
reward A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
outer_dims (optional) If provided, it will be used to determine the batch dimensions. If not, the batch dimensions will be inferred by reward's shape.

A TimeStep.