ML Community Day is November 9! Join us for updates from TensorFlow, JAX, and more Learn more

Module: tf_agents.bandits.metrics.tf_metrics

TF metrics for Bandits algorithms.

Classes

class ConstraintViolationsMetric: Computes the violations of a certain constraint.

class DistanceFromGreedyMetric: Difference between the estimated reward of the chosen and the best action.

class RegretMetric: Computes the regret with respect to a baseline.

class SuboptimalArmsMetric: Computes the number of suboptimal arms with respect to a baseline.