View source on GitHub
|
TF metrics for Bandits algorithms.
Classes
class ConstraintViolationsMetric: Computes the violations of a certain constraint.
class DistanceFromGreedyMetric: Difference between the estimated reward of the chosen and the best action.
class RegretMetric: Computes the regret with respect to a baseline.
class SuboptimalArmsMetric: Computes the number of suboptimal arms with respect to a baseline.
Other Members | |
|---|---|
| absolute_import |
Instance of __future__._Feature
|
| division |
Instance of __future__._Feature
|
| print_function |
Instance of __future__._Feature
|
View source on GitHub