Gets unique elements and their counts from the input dataset
.
tff.analytics.data_processing.get_unique_elements_with_counts(
dataset: tf.data.Dataset, string_max_bytes: Optional[int] = None
) -> tuple[tf.Tensor, tf.Tensor]
This method returns a tuple of elements
and counts
, where elements
are
the unique elements in the dataset, and counts is the number of times each one
appears.
The input dataset
must yield batched rank-1 tensors. This function reads
each coordinate of the tensor as an individual element and caps the total
number of elements to return.
Args | |
---|---|
dataset
|
A tf.data.Dataset to elements from. Element type must be
tf.string .
|
string_max_bytes
|
The maximum length (in bytes) of strings in the dataset.
Strings longer than string_max_bytes will be truncated. Defaults to
None , which means there is no limit of the string length.
|