Python unbatch_nested_tensors_to_arrays 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: tf_agents.utils.nest_utils

메소드/함수: unbatch_nested_tensors_to_arrays

hotexamples.com에서의 예제들: 2

Python unbatch_nested_tensors_to_arrays - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 tf_agents.utils.nest_utils.unbatch_nested_tensors_to_arrays에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: py_tf_eager_policy.py 프로젝트: weileze/agents

 def _action(self,
             time_step,
             policy_state,
             seed: Optional[types.Seed] = None):
     if seed is not None and self._use_tf_function:
         logging.warning(
             'Using `seed` may force a retrace for each call to `action`.')
     if self._batch_time_steps:
         time_step = nest_utils.batch_nested_array(time_step)
     # Avoid passing numpy arrays to avoid retracing of the tf.function.
     time_step = tf.nest.map_structure(tf.convert_to_tensor, time_step)
     if seed is not None:
         policy_step = self._policy_action_fn(time_step,
                                              policy_state,
                                              seed=seed)
     else:
         policy_step = self._policy_action_fn(time_step, policy_state)
     if not self._batch_time_steps:
         return policy_step
     return policy_step._replace(
         action=nest_utils.unbatch_nested_tensors_to_arrays(
             policy_step.action),
         # We intentionally do not convert the `state` so it is outputted as the
         # underlying policy generated it (i.e. in the form of a Tensor) which is
         # not necessarily compatible with a py-policy. However, we do so since
         # the `state` is fed back to the policy. So if it was converted, it'd be
         # required to convert back to the original form before calling the
         # method `action` of the policy again in the next step. If one wants to
         # store the `state` e.g. in replay buffer, then we suggest placing it
         # into the `info` field.
         info=nest_utils.unbatch_nested_tensors_to_arrays(policy_step.info))

예제 #2

파일 보기

파일: py_tf_eager_policy.py 프로젝트: pstanisl/agents

 def _action(self, time_step, policy_state):
   time_step = nest_utils.batch_nested_array(time_step)
   # Avoid passing numpy arrays to avoid retracing of the tf.function.
   time_step = tf.nest.map_structure(tf.convert_to_tensor, time_step)
   policy_step = self._policy_action_fn(time_step, policy_state)
   return policy_step._replace(
       action=nest_utils.unbatch_nested_tensors_to_arrays(policy_step.action),
       # We intentionally do not convert the `state` so it is outputted as the
       # underlying policy generated it (i.e. in the form of a Tensor) which is
       # not necessarily compatible with a py-policy. However, we do so since
       # the `state` is fed back to the policy. So if it was converted, it'd be
       # required to convert back to the original form before calling the
       # method `action` of the policy again in the next step. If one wants to
       # store the `state` e.g. in replay buffer, then we suggest placing it
       # into the `info` field.
       info=nest_utils.unbatch_nested_tensors_to_arrays(policy_step.info))