Outputs a tensor containing the reduction across all input tensors.
tf.raw_ops.NcclAllReduce(
input, reduction, num_devices, shared_name, name=None
)
Outputs a tensor containing the reduction across all input tensors passed to ops within the same `shared_name.
The graph should be constructed so if one op runs with shared_name value c,
then num_devices ops will run with shared_name value c. Failure to do so
will cause the graph execution to fail to complete.
input: the input to the reduction
data: the value of the reduction across all num_devices devices.
reduction: the reduction operation to perform.
num_devices: The number of devices participating in this reduction.
shared_name: Identifier that shared between ops of the same reduction.
Args | |
|---|---|
input
|
A Tensor. Must be one of the following types: half, float32, float64, int32, int64.
|
reduction
|
A string from: "min", "max", "prod", "sum".
|
num_devices
|
An int.
|
shared_name
|
A string.
|
name
|
A name for the operation (optional). |
Returns | |
|---|---|
A Tensor. Has the same type as input.
|