Google I/O is a wrap! Catch up on TensorFlow sessions View sessions

Module: tf_agents.bandits.agents.linear_thompson_sampling_agent

Implements the Linear Thompson Sampling bandit algorithm.


"Thompson Sampling for Contextual Bandits with Linear Payoffs", Shipra Agrawal, Navin Goyal, ICML 2013. The actual algorithm implemented is Algorithm 3 from the supplementary material of the paper from <a href=""></a>.


class LinearThompsonSamplingAgent: Linear Thompson Sampling Agent.

absolute_import Instance of __future__._Feature
division Instance of __future__._Feature
print_function Instance of __future__._Feature