View source on GitHub
  
 | 
Policy Step.
Classes
class CommonFields: Strings which can be used for querying returned PolicyStep.info field.
class PolicyInfo: PolicyInfo(log_probability,)
class PolicyStep: Returned with every call to policy.action() and policy.distribution().
Functions
get_log_probability(...): Gets the CommonFields.LOG_PROBABILITY from info depending on type.
set_log_probability(...): Sets the CommonFields.LOG_PROBABILITY on info to be log_probability.
Type Aliases
Other Members | |
|---|---|
| absolute_import | 
Instance of __future__._Feature
 | 
| division | 
Instance of __future__._Feature
 | 
| print_function | 
Instance of __future__._Feature
 | 
    View source on GitHub