Python reinit_nested_vars示例

编程语言: Python

命名空间/包名称: agents.ppo.utility

方法/功能: reinit_nested_vars

hotexamples.com的示例: 3

Python reinit_nested_vars - 已找到3个示例。这些是从开源项目中提取的最受好评的agents.ppo.utility.reinit_nested_vars现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

  def begin_episode(self, agent_indices):
    """Reset the recurrent states and stored episode.

    Args:
      agent_indices: 1D tensor of batch indices for agents starting an episode.

    Returns:
      Summary tensor.
    """
    with tf.name_scope('begin_episode/'):
      reset_state = utility.reinit_nested_vars(self._last_state, agent_indices)
      reset_buffer = self._episodes.clear(agent_indices)
      with tf.control_dependencies([reset_state, reset_buffer]):
        return tf.constant('')

示例#2

显示文件

    def begin_episode(self, agent_indices):
        """Reset the recurrent states and stored episode.

    Args:
      agent_indices: 1D tensor of batch indices for agents starting an episode.

    Returns:
      Summary tensor.
    """
        with tf.name_scope('begin_episode/'):
            reset_state = utility.reinit_nested_vars(self._last_state,
                                                     agent_indices)
            # At the beginning of episodes, first empty the self._episodes on the rows
            # specified by agent_indices, because self._episodes can store only one episode
            # per environments.
            reset_buffer = self._episodes.clear(agent_indices)
            with tf.control_dependencies([reset_state, reset_buffer]):
                return tf.constant('')

示例#3

显示文件

文件： algorithm.py 项目： plexzhang/agents

    def begin_episode(self, agent_indices):
        """Reset the recurrent states and stored episode.

    Args:
      agent_indices: Tensor containing current batch indices.

    Returns:
      Summary tensor.
    """
        with tf.name_scope('begin_episode/'):
            if self._last_state is None:
                reset_state = tf.no_op()
            else:
                reset_state = utility.reinit_nested_vars(
                    self._last_state, agent_indices)
            reset_buffer = self._episodes.clear(agent_indices)
            with tf.control_dependencies([reset_state, reset_buffer]):
                return tf.constant('')