|View source on GitHub|
A policy class that chooses from a set of policies to get the actions from.
This mixture policy takes a list of policies and will randomly choose one of
them for every observation. The distribution is defined by the
class MixturePolicy: A policy that chooses from a set of policies to decide the action.