tff.analytics.heavy_hitters.iblt.decode_iblt_tf
Decode a IBLT sketch.
tff.analytics.heavy_hitters.iblt.decode_iblt_tf(
iblt: tf.Tensor,
capacity: int,
string_max_bytes: int,
*,
encoding: tff.analytics.heavy_hitters.iblt.CharacterEncoding
= tff.analytics.heavy_hitters.iblt.CharacterEncoding.UTF8
,
seed: int = 0,
repetitions: int = DEFAULT_REPETITIONS,
hash_family: Optional[str] = None,
hash_family_params: Optional[dict[str, Union[int, float]]] = None,
field_size: int = DEFAULT_FIELD_SIZE
) -> tuple[tf.Tensor, tf.Tensor, tf.Tensor]
This function wraps IbltDecoder
to decode iblt
and returns frequencies
of decoded strings.
Args |
iblt
|
Tensor representing the IBLT computed by the IbltEncoder.
|
capacity
|
Number of distinct strings that we expect to be inserted.
|
string_max_bytes
|
Maximum length of a string in bytes that can be inserted.
|
encoding
|
The character encoding of the string data to decode. For
non-character binary data or strings with unknown encoding, specify
CharacterEncoding.UNKNOWN .
|
seed
|
Integer seed for hash functions. Defaults to 0.
|
repetitions
|
Number of repetitions in IBLT data structure (must be >= 3).
Defaults to 3.
|
hash_family
|
A str specifying the hash family to use to construct IBLT.
Options include coupled or random, default is chosen based on capacity.
|
hash_family_params
|
An optional dict of parameters that the hash family
hasher expects. Defaults are chosen based on capacity.
|
field_size
|
The field size for all values in IBLT. Defaults to 2**31 - 1.
|
Returns |
(out_strings, out_counts, num_not_decoded) where out_strings is
a tf.Tensor containing all the decoded strings, out_counts is a
tf.Tensor containing the counts of each string and num_not_decoded is
a tf.Tensor with the number of items not decoded in the IBLT.
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-09-20 UTC.
[null,null,["Last updated 2024-09-20 UTC."],[],[]]