Help protect the Great Barrier Reef with TensorFlow on Kaggle Join Challenge


Creates a beam pipeline yielding TFDS examples.

Each dataset shard will be processed in parallel.


builder = tfds.builder('my_dataset')

_ = (
    | tfds.beam.ReadFromTFDS(builder, split='train')
    | beam.Map(tfds.as_numpy)
    | ...

Use tfds.as_numpy to convert each examples from tf.Tensor to numpy.

pipeline beam pipeline (automatically set)
builder Dataset builder to load
split Split name to load (e.g. train+test, train)
**as_dataset_kwargs Arguments forwarded to builder.as_dataset.

The PCollection containing the TFDS examples.