tf_agents.agents.dqn.dqn_agent.DqnLossInfo

DqnLossInfo is stored in the extras field of the LossInfo instance.

tf_agents.agents.dqn.dqn_agent.DqnLossInfo(
    td_loss, td_error
)

Both td_loss and td_error have a validity mask applied to ensure that no loss or error is calculated for episode boundaries.

td_loss: The weighted TD loss (depends on choice of loss metric and any weights passed to the DQN loss function. td_error: The unweighted TD errors, which are just calculated as:

  td_error = td_targets - q_values

These can be used to update Prioritized Replay Buffer priorities.

Note that, unlike td_loss, td_error may contain a time dimension when training with RNN mode. For td_loss, this axis is averaged out.

Attributes
`td_loss`	A `namedtuple` alias for field number 0
`td_error`	A `namedtuple` alias for field number 1

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2024-04-26 UTC.

English
中文 – 简体

tf_agents.agents.dqn.dqn_agent.DqnLossInfo

Attributes