tf.raw_ops.QuantizedMatMulWithBiasAndRequantize
tf.raw_ops.QuantizedMatMulWithBiasAndRequantize(
a,
b,
bias,
min_a,
max_a,
min_b,
max_b,
min_freezed_output,
max_freezed_output,
Toutput=tf.dtypes.quint8
,
transpose_a=False,
transpose_b=False,
input_quant_mode='MIN_FIRST',
name=None
)
Args |
a
|
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 .
|
b
|
A Tensor . Must be one of the following types: qint8 , quint8 , qint32 , qint16 , quint16 .
|
bias
|
A Tensor . Must be one of the following types: float32 , qint32 .
|
min_a
|
A Tensor of type float32 .
|
max_a
|
A Tensor of type float32 .
|
min_b
|
A Tensor of type float32 .
|
max_b
|
A Tensor of type float32 .
|
min_freezed_output
|
A Tensor of type float32 .
|
max_freezed_output
|
A Tensor of type float32 .
|
Toutput
|
An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16 . Defaults to tf.quint8 .
|
transpose_a
|
An optional bool . Defaults to False .
|
transpose_b
|
An optional bool . Defaults to False .
|
input_quant_mode
|
An optional string from: "MIN_FIRST", "SCALED" . Defaults to "MIN_FIRST" .
|
name
|
A name for the operation (optional).
|
Returns |
A tuple of Tensor objects (out, min_out, max_out).
|
out
|
A Tensor of type Toutput .
|
min_out
|
A Tensor of type float32 .
|
max_out
|
A Tensor of type float32 .
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
Last updated 2024-01-23 UTC.
[null,null,["Last updated 2024-01-23 UTC."],[],[]]