View source on GitHub |
Policy implementation that generates greedy actions from another policy.
Classes
class DeterministicWithLogProb
: Thin wrapper around Deterministic that supports taking log_prob.
class GreedyPolicy
: Returns greedy samples of a given policy.
Other Members | |
---|---|
absolute_import |
Instance of __future__._Feature
|
division |
Instance of __future__._Feature
|
print_function |
Instance of __future__._Feature
|