tf_agents.trajectories.termination

Returns a TimeStep with step_type set to StepType.LAST.

View aliases

Main aliases

tf_agents.trajectories.time_step.termination

tf_agents.trajectories.termination(
    observation: tf_agents.typing.types.NestedTensorOrArray,
    reward: tf_agents.typing.types.NestedTensorOrArray,
    outer_dims: Optional[types.Shape] = None
) -> tf_agents.trajectories.TimeStep

Used in the notebooks

Used in the tutorials
Environments Tutorial on Multi Armed Bandits in TF-Agents

Args
`observation`	A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
`reward`	A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
`outer_dims`	(optional) If provided, it will be used to determine the batch dimensions. If not, the batch dimensions will be inferred by reward's shape.

Returns
A `TimeStep`.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2024-04-26 UTC.

English
中文 – 简体

tf_agents.trajectories.termination

View aliases

Used in the notebooks

Args

Returns