Module: tf_agents.bandits.policies.boltzmann_reward_prediction_policy

Policy for reward prediction and boltzmann exploration.

Classes

class BoltzmannRewardPredictionPolicy: Class to build Reward Prediction Policies with Boltzmann exploration.

absolute_import Instance of __future__._Feature
division Instance of __future__._Feature
print_function Instance of __future__._Feature