Thanks for tuning in to Google I/O. View all sessions on demandWatch on demand

Scales the input by a trainable scalar weight.

This is useful for applying ReZero to layers, which improves convergence speed. This implements the paper: ReZero is All You Need: Fast Convergence at Large Depth. (

initializer A str of initializer for the scalar weight.
regularizer A tf.keras.regularizers.Regularizer for the scalar weight.
**kwargs Additional keyword arguments to be passed to this layer.



View source

Calls the layer with the given inputs.