It can be used to take an input of batched observations and outputs
([batch_size, num_actions, num_atoms], network's state).
The first element of the output is a batch of logits based on the distribution
called C51 from Bellemare et al., 2017 (https://arxiv.org/abs/1707.06887). The
logits are used to compute approximate probability distributions for Q-values
for each potential action, by computing the probabilities at the 51 points
(called atoms) in np.linspace(-10.0, 10.0, 51).
A tensor_spec.TensorSpec specifying the observation
A tensor_spec.BoundedTensorSpec representing the actions.
The number of atoms to use in our approximate probability
distributions. Defaults to 51 to produce C51.
(Optional.) A nest of tf.keras.layers.Layer
representing preprocessing for the different observations. All of these
layers must not be already built. For more details see the documentation
(Optional.) A keras layer that takes a flat list
of tensors and combines them. Good options include tf.keras.layers.Add
and tf.keras.layers.Concatenate(axis=-1). This layer must not be
already built. For more details see the documentation of
Optional list of convolution layer parameters for
observations, where each item is a length-three tuple indicating
(num_units, kernel_size, stride).
Optional list of fully connected parameters for
observations, where each item is the number of units in the layer.
Activation function, e.g. tf.nn.relu or tf.nn.leaky_relu.
A string representing the name of the network.
action_spec is not a BoundedTensorSpec.
Returns the spec of the input to the network of type InputSpec.
Get the list of all (nested) sub-layers used in this Network.
(Optional). Override or provide an input tensor spec
when creating variables.
Other arguments to network.call(), e.g. training=True.
Output specs - a nested spec calculated from the outputs (excluding any
batch dimensions). If any of the output elements is a tfp Distribution,
the associated spec entry returned is a DistributionSpec.
If no input_tensor_spec is provided, and the network did
not provide one during construction.