Python _AutoShardDatasetV1の例

プログラミング言語: Python

名前空間/パッケージ名: tensorflow.python.data.experimental.ops.distribute

メソッド/関数: _AutoShardDatasetV1

hotexamples.comのコード掲載数: 2

Python _AutoShardDatasetV1 - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのtensorflow.python.data.experimental.ops.distribute._AutoShardDatasetV1の実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: input_ops.py プロジェクト: zjwangmin/tensorflow

def auto_shard_dataset(dataset, num_shards, index, num_replicas_in_sync=None):
    """Shard the input pipeline by sharding the underlying list of files.

  Args:
    dataset: A `tf.data.Dataset` instance, typically the result of a bunch of
      dataset transformations.
    num_shards: A `tf.int64` scalar `tf.Tensor`, representing the number of
        shards operating in parallel. Same usage as in `tf.data.Dataset.shard`.
    index: A `tf.int64` scalar `tf.Tensor`, representing the worker index.
      Same usage as in `tf.data.Dataset.shard`.
    num_replicas_in_sync: An integer representing the total number of replicas
      across all workers. This is used in the rewrite when sharding by data.

  Returns:
    A modified `Dataset` obtained by updating the pipeline sharded by the
    files. The input dataset will be returned if we cannot automatically
    determine a good way to shard the input dataset.
  """
    if (dataset.options().experimental_distribute.auto_shard_policy !=
            AutoShardPolicy.OFF):
        if num_replicas_in_sync is None:
            num_replicas_in_sync = 1
        if isinstance(dataset, dataset_ops.DatasetV1):
            return distribute._AutoShardDatasetV1(dataset, num_shards, index,
                                                  num_replicas_in_sync)
        else:
            return distribute._AutoShardDataset(dataset, num_shards, index,
                                                num_replicas_in_sync)
    else:
        return dataset

コード例 #2

ファイルを表示

def auto_shard_dataset(dataset, num_shards, index):
  """Shard the input pipeline by sharding the underlying list of files.

  Args:
    dataset: A `tf.data.Dataset` instance, typically the result of a bunch of
      dataset transformations.
    num_shards: A `tf.int64` scalar `tf.Tensor`, representing the number of
        shards operating in parallel. Same usage as in `tf.data.Dataset.shard`.
    index: A `tf.int64` scalar `tf.Tensor`, representing the worker index.
      Same usage as in `tf.data.Dataset.shard`.

  Returns:
    A modified `Dataset` obtained by updating the pipeline sharded by the
    files. The input dataset will be returned if we cannot automatically
    determine a good way to shard the input dataset.
  """
  if isinstance(dataset, dataset_ops.DatasetV1):
    return distribute._AutoShardDatasetV1(dataset, num_shards, index)
  else:
    return distribute._AutoShardDataset(dataset, num_shards, index)