Public API for tf.quantization namespace.
Functions
dequantize(...): Dequantize the 'input' tensor into a float or bfloat16 Tensor.
fake_quant_with_min_max_args(...): Fake-quantize the 'inputs' tensor, type float to 'outputs' tensor of same shape and type.
fake_quant_with_min_max_args_gradient(...): Compute gradients for a FakeQuantWithMinMaxArgs operation.
fake_quant_with_min_max_vars(...): Fake-quantize the 'inputs' tensor of type float via global float scalars
fake_quant_with_min_max_vars_gradient(...): Compute gradients for a FakeQuantWithMinMaxVars operation.
fake_quant_with_min_max_vars_per_channel(...): Fake-quantize the 'inputs' tensor of type float via per-channel floats
fake_quant_with_min_max_vars_per_channel_gradient(...): Compute gradients for a FakeQuantWithMinMaxVarsPerChannel operation.
quantize(...): Quantize the 'input' tensor of type float to 'output' tensor of type 'T'.
quantize_and_dequantize(...): Quantizes then dequantizes a tensor. (deprecated)
quantize_and_dequantize_v2(...): Quantizes then dequantizes a tensor.
quantized_concat(...): Concatenates quantized tensors along one dimension.