View source on GitHub |
An agent that uses and trains a greedy reward prediction policy.
Classes
class GreedyMultiObjectiveNeuralAgent
: A neural-network based bandit agent for multi-objective optimization.
Other Members | |
---|---|
absolute_import |
Instance of __future__._Feature
|
division |
Instance of __future__._Feature
|
print_function |
Instance of __future__._Feature
|