View source on GitHub |
TF metrics for Bandits algorithms.
Classes
class ConstraintViolationsMetric
: Computes the violations of a certain constraint.
class DistanceFromGreedyMetric
: Difference between the estimated reward of the chosen and the best action.
class RegretMetric
: Computes the regret with respect to a baseline.
class SuboptimalArmsMetric
: Computes the number of suboptimal arms with respect to a baseline.
Other Members | |
---|---|
absolute_import |
Instance of __future__._Feature
|
division |
Instance of __future__._Feature
|
print_function |
Instance of __future__._Feature
|