Python multinomial示例

编程语言: Python

命名空间/包名称: haiku

方法/功能: multinomial

hotexamples.com的示例: 2

Python multinomial - 已找到2个示例。这些是从开源项目中提取的最受好评的haiku.multinomial现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： impala_lite.py 项目： shyamalschandra/haiku

 def step(
     self,
     params: hk.Params,
     rng: jnp.ndarray,
     timestep: dm_env.TimeStep,
 ) -> Tuple[jnp.ndarray, jnp.ndarray]:
   """Steps on a single observation."""
   timestep = jax.tree_map(lambda t: jnp.expand_dims(t, 0), timestep)
   logits, _ = self._net(params, timestep)
   logits = jnp.squeeze(logits, axis=0)
   action = hk.multinomial(rng, logits, num_samples=1)
   action = jnp.squeeze(action, axis=-1)
   return action, logits

示例#2

显示文件

    def step(
        self,
        rng_key,
        params: hk.Params,
        timestep: dm_env.TimeStep,
        state: Nest,
    ) -> Tuple[AgentOutput, Nest]:
        """For a given single-step, unbatched timestep, output the chosen action."""
        # Pad timestep, state to be [T, B, ...] and [B, ...] respectively.
        timestep = jax.tree_map(lambda t: t[None, None, ...], timestep)
        state = jax.tree_map(lambda t: t[None, ...], state)

        net_out, next_state = self._apply_fn(params, timestep, state)
        # Remove the padding from above.
        net_out = jax.tree_map(lambda t: jnp.squeeze(t, axis=(0, 1)), net_out)
        next_state = jax.tree_map(lambda t: jnp.squeeze(t, axis=0), next_state)
        # Sample an action and return.
        action = hk.multinomial(rng_key, net_out.policy_logits, num_samples=1)
        action = jnp.squeeze(action, axis=-1)
        return AgentOutput(net_out.policy_logits, net_out.value,
                           action), next_state