View source on GitHub
|
Wraps proto.SplitInfo with an additional property.
tfds.core.SplitInfo(
name: str,
shard_lengths: List[int],
num_bytes: int,
filename_template: Optional[naming.ShardedFileTemplate] = None,
statistics: statistics_pb2.DatasetFeatureStatistics = dataclasses.field(default_factory=statistics_pb2.DatasetFeatureStatistics)
)
Attributes | |
|---|---|
name
|
Name of the split (e.g. train, test,...)
|
shard_lengths
|
List of length |
filename_template
|
The template used to create sharded filenames. |
num_examples
|
Total number of examples (sum(shard_lengths))
|
num_shards
|
Number of files (len(shard_lengths))
|
num_bytes
|
Size of the files (in bytes) |
statistics
|
Additional statistics of the split. |
file_instructions
|
Returns the list of dict(filename, take, skip).
This allows for creating your own When |
filenames
|
Returns the list of filenames. |
filepaths
|
All the paths for all the files that are part of this split. |
Methods
from_proto
@classmethodfrom_proto( proto: proto_lib.SplitInfo, filename_template: naming.ShardedFileTemplate ) -> 'SplitInfo'
Returns a SplitInfo class instance from a SplitInfo proto.
replace
replace(
**kwargs
) -> 'SplitInfo'
Returns a copy of the SplitInfo with updated attributes.
to_proto
to_proto() -> proto_lib.SplitInfo
Class Variables | |
|---|---|
| filename_template |
None
|
View source on GitHub