View source on GitHub |
Implements the EXP3 bandit algorithm.
Implementation based on
"Bandit Algorithms" Lattimore and Szepesvari, 2019 https://tor-lattimore.com/downloads/book/book.pdf
Classes
class Exp3Agent
: An agent implementing the EXP3 bandit algorithm.
Functions
selective_sum(...)
: Sums entries in values
, partitioned using partitions
.
Other Members | |
---|---|
absolute_import |
Instance of __future__._Feature
|
division |
Instance of __future__._Feature
|
print_function |
Instance of __future__._Feature
|