Python multinomial 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: haiku

메소드/함수: multinomial

hotexamples.com에서의 예제들: 2

Python multinomial - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 haiku.multinomial에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: impala_lite.py 프로젝트: shyamalschandra/haiku

 def step(
     self,
     params: hk.Params,
     rng: jnp.ndarray,
     timestep: dm_env.TimeStep,
 ) -> Tuple[jnp.ndarray, jnp.ndarray]:
   """Steps on a single observation."""
   timestep = jax.tree_map(lambda t: jnp.expand_dims(t, 0), timestep)
   logits, _ = self._net(params, timestep)
   logits = jnp.squeeze(logits, axis=0)
   action = hk.multinomial(rng, logits, num_samples=1)
   action = jnp.squeeze(action, axis=-1)
   return action, logits

예제 #2

파일 보기

    def step(
        self,
        rng_key,
        params: hk.Params,
        timestep: dm_env.TimeStep,
        state: Nest,
    ) -> Tuple[AgentOutput, Nest]:
        """For a given single-step, unbatched timestep, output the chosen action."""
        # Pad timestep, state to be [T, B, ...] and [B, ...] respectively.
        timestep = jax.tree_map(lambda t: t[None, None, ...], timestep)
        state = jax.tree_map(lambda t: t[None, ...], state)

        net_out, next_state = self._apply_fn(params, timestep, state)
        # Remove the padding from above.
        net_out = jax.tree_map(lambda t: jnp.squeeze(t, axis=(0, 1)), net_out)
        next_state = jax.tree_map(lambda t: jnp.squeeze(t, axis=0), next_state)
        # Sample an action and return.
        action = hk.multinomial(rng_key, net_out.policy_logits, num_samples=1)
        action = jnp.squeeze(action, axis=-1)
        return AgentOutput(net_out.policy_logits, net_out.value,
                           action), next_state