![]() |
PPOLossInfo(policy_gradient_loss, value_estimation_loss, l2_regularization_loss, entropy_regularization_loss, kl_penalty_loss)
tf_agents.agents.ppo.ppo_agent.PPOLossInfo(
policy_gradient_loss, value_estimation_loss, l2_regularization_loss,
entropy_regularization_loss, kl_penalty_loss
)
Attributes | |
---|---|
policy_gradient_loss
|
|
value_estimation_loss
|
|
l2_regularization_loss
|
|
entropy_regularization_loss
|
|
kl_penalty_loss
|