tf.raw_ops.NcclAllReduce

Outputs a tensor containing the reduction across all input tensors.

Outputs a tensor containing the reduction across all input tensors passed to ops within the same `shared_name.

The graph should be constructed so if one op runs with shared_name value c, then num_devices ops will run with shared_name value c. Failure to do so will cause the graph execution to fail to complete.

input: the input to the reduction data: the value of the reduction across all num_devices devices. reduction: the reduction operation to perform. num_devices: The number of devices participating in this reduction. shared_name: Identifier that shared between ops of the same reduction.

input A Tensor. Must be one of the following types: half, float32, float64, int32, int64.
reduction A string from: "min", "max", "prod", "sum".
num_devices An int.
shared_name A string.
name A name for the operation (optional).

A Tensor. Has the same type as input.