tfp.substrates.jax.distributions.TransformedDistribution

A Transformed Distribution.

Inherits From: Distribution

View aliases

Main aliases

tfp.experimental.substrates.jax.distributions.TransformedDistribution

tfp.substrates.jax.distributions.TransformedDistribution(
    distribution,
    bijector,
    kwargs_split_fn=_default_kwargs_split_fn,
    validate_args=False,
    parameters=None,
    name=None
)

A TransformedDistribution models p(y) given a base distribution p(x), and a deterministic, invertible, differentiable transform, Y = g(X). The transform is typically an instance of the Bijector class and the base distribution is typically an instance of the Distribution class.

A Bijector is expected to implement the following functions:

forward,
inverse,
inverse_log_det_jacobian.

The semantics of these functions are outlined in the Bijector documentation.

We now describe how a TransformedDistribution alters the input/outputs of a Distribution associated with a random variable (rv) X.

Write cdf(Y=y) for an absolutely continuous cumulative distribution function of random variable Y; write the probability density function pdf(Y=y) := d^k / (dy_1,...,dy_k) cdf(Y=y) for its derivative wrt to Y evaluated at y. Assume that Y = g(X) where g is a deterministic diffeomorphism, i.e., a non-random, continuous, differentiable, and invertible function. Write the inverse of g as X = g^{-1}(Y) and (J o g)(x) for the Jacobian of g evaluated at x.

A TransformedDistribution implements the following operations:
- sample Mathematically: Y = g(X) Programmatically: bijector.forward(distribution.sample(...))
- log_prob Mathematically: (log o pdf)(Y=y) = (log o pdf o g^{-1})(y) + (log o abs o det o J o g^{-1})(y) Programmatically: (distribution.log_prob(bijector.inverse(y)) + bijector.inverse_log_det_jacobian(y))
- log_cdf Mathematically: (log o cdf)(Y=y) = (log o cdf o g^{-1})(y) Programmatically: distribution.log_cdf(bijector.inverse(x))
- and similarly for: cdf, prob, log_survival_function, survival_function.
Kullback-Leibler divergence is also well defined for TransformedDistribution instances that have matching bijectors. Bijector matching is performed via the Bijector.eq method, e.g., td1.bijector == td2.bijector, as part of the KL calculation. If the underlying bijectors do not match, a NotImplementedError is raised when calling kl_divergence. This is the same behavior as calling kl_divergence when two distributions do not have a registered KL divergence.

Note: Due to the current constraints imposed on bijector equality testing, kl_divergence may behave differently in eager mode computation vs. traced computation. For example, if a TD Bijector's parameters are Tensor objects, and are themselves derived from e.g. a Variable, some stateful operation, or from an argument to a tf.function then Bijector equality cannot be known during the call to kl_divergence and the bijectors are assumed unequal. In this case, calling kl_divergence may raise an exception in graph / tf.function mode, but work just fine in eager / numpy mode.

A simple example constructing a Log-Normal distribution from a Normal distribution:
```
tfd = tfp.distributions
tfb = tfp.bijectors
log_normal = tfd.TransformedDistribution(
  distribution=tfd.Normal(loc=0., scale=1.),
  bijector=tfb.Exp(),
  name='LogNormalTransformedDistribution')
```
A LogNormal made from callables:
```
tfd = tfp.distributions
tfb = tfp.bijectors
log_normal = tfd.TransformedDistribution(
  distribution=tfd.Normal(loc=0., scale=1.),
  bijector=tfb.Inline(
    forward_fn=tf.exp,
    inverse_fn=tf.log,
    inverse_log_det_jacobian_fn=(
      lambda y: -tf.reduce_sum(tf.log(y), axis=-1)),
  name='LogNormalTransformedDistribution')
```
Another example constructing a Normal from a StandardNormal:
```
tfd = tfp.distributions
tfb = tfp.bijectors
normal = tfd.TransformedDistribution(
  distribution=tfd.Normal(loc=0., scale=1.),
  bijector=tfb.Shift(shift=-1.)(tfb.Scale(scale=2.)),
  name='NormalTransformedDistribution')
```
A TransformedDistribution's batch_shape is derived by broadcasting the batch shapes of the base distribution and the bijector. The base distribution is then itself implicitly lifted to the broadcast batch shape. For example, in
```
tfd = tfp.distributions
tfb = tfp.bijectors
batch_normal = tfd.TransformedDistribution(
  distribution=tfd.Normal(loc=0., scale=1.),
  bijector=tfb.Shift(shift=[-1., 0., 1.]),
  name='BatchNormalTransformedDistribution')
```
the base distribution has batch shape [], and the bijector applied to this distribution contributes a batch shape of [3] (obtained as bijector.experimental_batch_shape( x_event_ndims=tf.rank(distribution.event_shape)), yielding the broadcast shape batch_normal.batch_shape == [3]. Although sampling from the base distribution would ordinarily return just a single value, calling batch_normal.sample() will return a Tensor of 3 independent values, just as if the base distribution had explicitly followed the broadcast batch shape.

The event_shape of a TransformedDistribution is the forward_event_shape of the bijector applied to the event_shape of the base distribution.

tfd.Sample or tfd.Independent may be used to add extra IID dimensions to the event_shape of the base distribution before the bijector operates on it. The following example demonstrates how to construct a multivariate Normal as a TransformedDistribution, by adding a rank-1 IID dimension to the event_shape of a standard Normal and applying tfb.ScaleMatvecTriL.
```
tfd = tfp.distributions
tfb = tfp.bijectors
# We will create two MVNs with batch_shape = event_shape = 2.
mean = [[-1., 0],      # batch:0
        [0., 1]]       # batch:1
chol_cov = [[[1., 0],
             [0, 1]],  # batch:0
            [[1, 0],
             [2, 2]]]  # batch:1
mvn1 = tfd.TransformedDistribution(
    distribution=tfd.Sample(
        tfd.Normal(loc=[0., 0], scale=1.),  # base_dist.batch_shape == [2]
        sample_shape=[2])                   # base_dist.event_shape == [2]
    bijector=tfb.Shift(shift=mean)(tfb.ScaleMatvecTriL(scale_tril=chol_cov)))
mvn2 = ds.MultivariateNormalTriL(loc=mean, scale_tril=chol_cov)
# mvn1.log_prob(x) == mvn2.log_prob(x)
```

If both distribution and bijector are CompositeTensors, then the resulting TransformedDistribution instance is a CompositeTensor as well. Otherwise, a non-CompositeTensor _TransformedDistribution instance is created instead. Distribution subclasses that inherit from TransformedDistribution will also inherit from CompositeTensor.

Args
`distribution`	The base distribution instance to transform. Typically an instance of `Distribution`.
`bijector`	The object responsible for calculating the transformation. Typically an instance of `Bijector`.
`kwargs_split_fn`	Python `callable` which takes a kwargs `dict` and returns a tuple of kwargs `dict`s for each of the `distribution` and `bijector` parameters respectively. Default value: `_default_kwargs_split_fn` (i.e., `lambda kwargs: (kwargs.get('distribution_kwargs', {}), kwargs.get('bijector_kwargs', {}))`)
`validate_args`	Python `bool`, default `False`. When `True` distribution parameters are checked for validity despite possibly degrading runtime performance. When `False` invalid inputs may silently render incorrect outputs.
`parameters`	Locals dict captured by subclass constructor, to be used for copy/slice re-instantiation operations.
`name`	Python `str` name prefixed to Ops created by this class. Default: `bijector.name + distribution.name`.

Attributes
`allow_nan_stats`	Python `bool` describing behavior when a stat is undefined. Stats return +/- infinity when it makes sense. E.g., the variance of a Cauchy distribution is infinity. However, sometimes the statistic is undefined, e.g., if a distribution's pdf does not achieve a maximum within the support of the distribution, the mode is undefined. If the mean is undefined, then by definition the variance is undefined. E.g. the mean for Student's T for df = 1 is undefined (no clear way to say it is either + or - infinity), so the variance = E[(X - mean)**2] is also undefined.
`batch_shape`	Shape of a single sample from a single event index as a `TensorShape`. May be partially defined or unknown. The batch dimensions are indexes into independent, non-identical parameterizations of this distribution.
`bijector`	Function transforming x => y.
`distribution`	Base distribution, p(x).
`dtype`	The `DType` of `Tensor`s handled by this `Distribution`.
`event_shape`	Shape of a single sample from a single batch as a `TensorShape`. May be partially defined or unknown.
`experimental_is_sharded`
`experimental_shard_axis_names`	The list or structure of lists of active shard axis names.
`name`	Name prepended to all ops created by this `Distribution`.
`parameters`	Dictionary of parameters used to instantiate this `Distribution`.
`reparameterization_type`	Describes how samples from the distribution are reparameterized. Currently this is one of the static instances `tfd.FULLY_REPARAMETERIZED` or `tfd.NOT_REPARAMETERIZED`.
`trainable_variables`
`validate_args`	Python `bool` indicating possibly expensive checks are enabled.
`variables`

Args
`value`	`float` or `double` `Tensor`.
`name`	Python `str` prepended to names of ops created by this function.
`**kwargs`	Named arguments forwarded to subclass implementation.

Args
`other`	`tfp.distributions.Distribution` instance.
`name`	Python `str` prepended to names of ops created by this function.

Args
`*args`	Passed to implementation `_default_event_space_bijector`.
`**kwargs`	Passed to implementation `_default_event_space_bijector`.

Args
`value`	a `Tensor` valid sample from this distribution family.
`sample_ndims`	Positive `int` Tensor number of leftmost dimensions of `value` that index i.i.d. samples. Default value: `1`.
`validate_args`	Python `bool`, default `False`. When `True`, distribution parameters are checked for validity despite possibly degrading runtime performance. When `False`, invalid inputs may silently render incorrect outputs. Default value: `False`.
`**init_kwargs`	Additional keyword arguments passed through to `cls.__init__`. These take precedence in case of collision with the fitted parameters; for example, `tfd.Normal.experimental_fit([1., 1.], scale=20.)` returns a Normal distribution with `scale=20.` rather than the maximum likelihood parameter `scale=0.`.

Args
`value`	`float` or `double` `Tensor`.
`backward_compat`	`bool` specifying whether to fall back to returning `FullSpace` as the tangent space, and representing R^n with the standard basis.
`**kwargs`	Named arguments forwarded to subclass implementation.

Returns
`log_prob`	a `Tensor` representing the log probability density, of shape `sample_shape(x) + self.batch_shape` with values of type `self.dtype`.
`tangent_space`	a `TangentSpace` object (by default `FullSpace`) representing the tangent space to the manifold at `value`.

Args
`sample_shape`	integer `Tensor` desired shape of samples to draw. Default value: `()`.
`seed`	PRNG seed; see `tfp.random.sanitize_seed` for details. Default value: `None`.
`name`	name to give to the op. Default value: `'sample_and_log_prob'`.
`**kwargs`	Named arguments forwarded to subclass implementation.

Returns
`samples`	a `Tensor`, or structure of `Tensor`s, with prepended dimensions `sample_shape`.
`log_prob`	a `Tensor` of shape `sample_shape(x) + self.batch_shape` with values of type `self.dtype`.

Args
`sample_shape`	`Tensor` or python list/tuple. Desired shape of a call to `sample()`.
`name`	name to prepend ops with.

Args
`dtype`	Optional float `dtype` to assume for continuous-valued parameters. Some constraining bijectors require advance knowledge of the dtype because certain constants (e.g., `tfb.Softplus.low`) must be instantiated with the same dtype as the values to be transformed.
`num_classes`	Optional `int` `Tensor` number of classes to assume when inferring the shape of parameters for categorical-like distributions. Otherwise ignored.

Args
`sample_shape`	0D or 1D `int32` `Tensor`. Shape of the generated samples.
`seed`	PRNG seed; see `tfp.random.sanitize_seed` for details.
`name`	name to give to the op.
`**kwargs`	Named arguments forwarded to subclass implementation.

tfp.substrates.jax.distributions.TransformedDistribution Stay organized with collections Save and categorize content based on your preferences.

View aliases

Args

Attributes

Methods

batch_shape_tensor

cdf

copy

covariance

cross_entropy

entropy

event_shape_tensor

experimental_default_event_space_bijector

experimental_fit

experimental_local_measure

experimental_sample_and_log_prob

is_scalar_batch

is_scalar_event

kl_divergence

log_cdf

log_prob

log_survival_function

mean

mode

param_shapes

param_static_shapes

parameter_properties

prob

quantile

sample

stddev

survival_function

unnormalized_log_prob

variance

__getitem__

__iter__

tfp.substrates.jax.distributions.TransformedDistribution

`batch_shape_tensor`

`cdf`

`copy`

`covariance`

`cross_entropy`

`entropy`

`event_shape_tensor`

`experimental_default_event_space_bijector`

`experimental_fit`

`experimental_local_measure`

`experimental_sample_and_log_prob`

`is_scalar_batch`

`is_scalar_event`

`kl_divergence`

`log_cdf`

`log_prob`

`log_survival_function`

`mean`

`mode`

`param_shapes`

`param_static_shapes`

`parameter_properties`

`prob`

`quantile`

`sample`

`stddev`

`survival_function`

`unnormalized_log_prob`

`variance`

`getitem`

`iter`