Join TensorFlow at Google I/O, May 11-12 Register now


Compute data statistics for the input pandas DataFrame.

This is a utility method for users with in-memory data represented as a pandas DataFrame.

dataframe Input pandas DataFrame.
stats_options tfdv.StatsOptions for generating data statistics.
n_jobs Number of processes to run (defaults to 1). If -1 is provided, uses the same number of processes as the number of CPU cores.

A DatasetFeatureStatisticsList proto.