tf.data.experimental.TFRecordWriter
Stay organized with collections
Save and categorize content based on your preferences.
Writes data to a TFRecord file.
tf.data.experimental.TFRecordWriter(
filename, compression_type=None
)
To write a dataset
to a single TFRecord file:
dataset = ... # dataset to be written
writer = tf.data.experimental.TFRecordWriter(PATH)
writer.write(dataset)
To shard a dataset
across multiple TFRecord files:
dataset = ... # dataset to be written
def reduce_func(key, dataset):
filename = tf.strings.join([PATH_PREFIX, tf.strings.as_string(key)])
writer = tf.data.experimental.TFRecordWriter(filename)
writer.write(dataset.map(lambda _, x: x))
return tf.data.Dataset.from_tensors(filename)
dataset = dataset.enumerate()
dataset = dataset.apply(tf.data.experimental.group_by_window(
lambda i, _: i % NUM_SHARDS, reduce_func, tf.int64.max
))
Methods
write
View source
write(
dataset
)
Returns a tf.Operation
to write a dataset to a file.
Returns |
A tf.Operation that, when run, writes contents of dataset to a file.
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2020-10-01 UTC.
[null,null,["Last updated 2020-10-01 UTC."],[],[],null,["# tf.data.experimental.TFRecordWriter\n\n\u003cbr /\u003e\n\n|------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------|\n| [TensorFlow 2 version](/api_docs/python/tf/data/experimental/TFRecordWriter) | [View source on GitHub](https://github.com/tensorflow/tensorflow/blob/v1.15.0/tensorflow/python/data/experimental/ops/writers.py#L30-L87) |\n\nWrites data to a TFRecord file.\n\n#### View aliases\n\n\n**Compat aliases for migration**\n\nSee\n[Migration guide](https://www.tensorflow.org/guide/migrate) for\nmore details.\n\n[`tf.compat.v1.data.experimental.TFRecordWriter`](/api_docs/python/tf/data/experimental/TFRecordWriter), \\`tf.compat.v2.data.experimental.TFRecordWriter\\`\n\n\u003cbr /\u003e\n\n tf.data.experimental.TFRecordWriter(\n filename, compression_type=None\n )\n\nTo write a `dataset` to a single TFRecord file: \n\n dataset = ... # dataset to be written\n writer = tf.data.experimental.TFRecordWriter(PATH)\n writer.write(dataset)\n\nTo shard a `dataset` across multiple TFRecord files: \n\n dataset = ... # dataset to be written\n\n def reduce_func(key, dataset):\n filename = tf.strings.join([PATH_PREFIX, tf.strings.as_string(key)])\n writer = tf.data.experimental.TFRecordWriter(filename)\n writer.write(dataset.map(lambda _, x: x))\n return tf.data.Dataset.from_tensors(filename)\n\n dataset = dataset.enumerate()\n dataset = dataset.apply(tf.data.experimental.group_by_window(\n lambda i, _: i % NUM_SHARDS, reduce_func, tf.int64.max\n ))\n\nMethods\n-------\n\n### `write`\n\n[View source](https://github.com/tensorflow/tensorflow/blob/v1.15.0/tensorflow/python/data/experimental/ops/writers.py#L68-L87) \n\n write(\n dataset\n )\n\nReturns a [`tf.Operation`](../../../tf/Operation) to write a dataset to a file.\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Args ||\n|-----------|--------------------------------------------------------------------------------------------|\n| `dataset` | a [`tf.data.Dataset`](../../../tf/data/Dataset) whose elements are to be written to a file |\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Returns ||\n|---|---|\n| A [`tf.Operation`](../../../tf/Operation) that, when run, writes contents of `dataset` to a file. ||\n\n\u003cbr /\u003e"]]