Module: tf_agents.bandits.metrics.tf_metrics

TF metrics for Bandits algorithms.


class ConstraintViolationsMetric: Computes the violations of a certain constraint.

class DistanceFromGreedyMetric: Difference between the estimated reward of the chosen and the best action.

class RegretMetric: Computes the regret with respect to a baseline.

class SuboptimalArmsMetric: Computes the number of suboptimal arms with respect to a baseline.

absolute_import Instance of __future__._Feature
division Instance of __future__._Feature
print_function Instance of __future__._Feature