Update relevant entries in 'var' and 'accum' according to the momentum scheme.
tf.raw_ops.SparseApplyMomentum(
    var, accum, lr, grad, indices, momentum, use_locking=False, use_nesterov=False,
    name=None
)
Set use_nesterov = True if you want to use Nesterov momentum.
That is for rows we have grad for, we update var and accum as follows:
 $$accum = accum * momentum + grad$$ 
 $$var -= lr * accum$$ 
| Args | |
|---|---|
| var | A mutable Tensor. Must be one of the following types:float32,float64,int32,uint8,int16,int8,complex64,int64,qint8,quint8,qint32,bfloat16,uint16,complex128,half,uint32,uint64.
Should be from a Variable(). | 
| accum | A mutable Tensor. Must have the same type asvar.
Should be from a Variable(). | 
| lr | A Tensor. Must have the same type asvar.
Learning rate. Must be a scalar. | 
| grad | A Tensor. Must have the same type asvar. The gradient. | 
| indices | A Tensor. Must be one of the following types:int32,int64.
A vector of indices into the first dimension of var and accum. | 
| momentum | A Tensor. Must have the same type asvar.
Momentum. Must be a scalar. | 
| use_locking | An optional bool. Defaults toFalse.
IfTrue, updating of the var and accum tensors will be protected
by a lock; otherwise the behavior is undefined, but may exhibit less
contention. | 
| use_nesterov | An optional bool. Defaults toFalse.
IfTrue, the tensor passed to compute grad will be
var - lr * momentum * accum, so in the end, the var you get is actually
var - lr * momentum * accum. | 
| name | A name for the operation (optional). | 
| Returns | |
|---|---|
| A mutable Tensor. Has the same type asvar. |