Module: tf_agents.bandits.policies.mixture_policy

A policy class that chooses from a set of policies to get the actions from.

This mixture policy takes a list of policies and will randomly choose one of them for every observation. The distribution is defined by the mixture_distribution.

Classes

class MixturePolicy: A policy that chooses from a set of policies to decide the action.

MIXTURE_AGENT_ID 'mixture_agent_id'
SUBPOLICY_INFO 'subpolicy_info'
absolute_import Instance of __future__._Feature
division Instance of __future__._Feature
print_function Instance of __future__._Feature