tf.raw_ops.SparseApplyProximalGradientDescent
Sparse update '*var' as FOBOS algorithm with fixed learning rate.
View aliases
Compat aliases for migration
See
Migration guide for
more details.
tf.compat.v1.raw_ops.SparseApplyProximalGradientDescent
tf.raw_ops.SparseApplyProximalGradientDescent(
var, alpha, l1, l2, grad, indices, use_locking=False, name=None
)
That is for rows we have grad for, we update var as follows:
proxv=var−alpha∗grad
var=sign(proxv)/(1+alpha∗l2)∗max|proxv|−alpha∗l1,0
Args |
var
|
A mutable Tensor . Must be one of the following types: float32 , float64 , int32 , uint8 , int16 , int8 , complex64 , int64 , qint8 , quint8 , qint32 , bfloat16 , qint16 , quint16 , uint16 , complex128 , half , uint32 , uint64 .
Should be from a Variable().
|
alpha
|
A Tensor . Must have the same type as var .
Scaling factor. Must be a scalar.
|
l1
|
A Tensor . Must have the same type as var .
L1 regularization. Must be a scalar.
|
l2
|
A Tensor . Must have the same type as var .
L2 regularization. Must be a scalar.
|
grad
|
A Tensor . Must have the same type as var . The gradient.
|
indices
|
A Tensor . Must be one of the following types: int32 , int64 .
A vector of indices into the first dimension of var and accum.
|
use_locking
|
An optional bool . Defaults to False .
If True, the subtraction will be protected by a lock;
otherwise the behavior is undefined, but may exhibit less contention.
|
name
|
A name for the operation (optional).
|
Returns |
A mutable Tensor . Has the same type as var .
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
Last updated 2024-04-26 UTC.
[null,null,["Last updated 2024-04-26 UTC."],[],[]]