Python Softmax示例

编程语言: Python

命名空间/包名称: distrax

方法/功能: Softmax

hotexamples.com的示例: 12

Python Softmax - 已找到12个示例。这些是从开源项目中提取的最受好评的distrax.Softmax现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： distributions.py 项目： deepmind/rlax

def categorical_kl_divergence(p_logits: Array,
                              q_logits: Array,
                              temperature: float = 1.) -> Array:
    """Compute the KL between two categorical distributions from their logits.

  Args:
    p_logits: unnormalized logits for the first distribution.
    q_logits: unnormalized logits for the second distribution.
    temperature: the temperature for the softmax distribution, defaults at 1.

  Returns:
    the kl divergence between the distributions.
  """
    warnings.warn(
        "Rlax categorical_kl_divergence will be deprecated. "
        "Please use distrax.Softmax.kl_divergence instead.",
        PendingDeprecationWarning,
        stacklevel=2)
    return distrax.Softmax(p_logits, temperature).kl_divergence(
        distrax.Softmax(q_logits, temperature))

示例#2

显示文件

文件： distributions.py 项目： deepmind/rlax

 def entropy_fn(logits: Array):
     return jnp.minimum(
         distrax.Softmax(logits, temperature).entropy(),
         entropy_clip * jnp.log(logits.shape[-1]))

示例#3

显示文件

文件： distributions.py 项目： deepmind/rlax

 def logprob_fn(sample: Array, logits: Array, action_spec=None):
     del action_spec
     return distrax.Softmax(logits, temperature).log_prob(sample)

示例#4

显示文件

文件： distributions.py 项目： deepmind/rlax

 def probs_fn(logits: Array, action_spec=None):
     del action_spec
     return distrax.Softmax(logits, temperature).probs

示例#5

显示文件

文件： distributions.py 项目： deepmind/rlax

 def sample_fn(key: Array, logits: Array, action_spec=None):
     del action_spec
     return distrax.Softmax(logits, temperature).sample(seed=key)

示例#6

显示文件

文件： distributions.py 项目： deepmind/rlax

 def entropy_fn(logits: Array):
     return distrax.Softmax(logits, temperature).entropy()

示例#7

显示文件

文件： distributions.py 项目： deepmind/rlax

 def logprob_fn(sample: Array, logits: Array):
     return distrax.Softmax(logits, temperature).log_prob(sample)

示例#8

显示文件

文件： distributions.py 项目： deepmind/rlax

 def probs_fn(logits: Array):
     return distrax.Softmax(logits, temperature).probs

示例#9

显示文件

文件： distributions.py 项目： deepmind/rlax

 def sample_fn(key: Array, logits: Array):
     return distrax.Softmax(logits, temperature).sample(seed=key)

示例#10

显示文件

文件： distributions.py 项目： deepmind/rlax

 def entropy_fn(logits: Array):
     probs = distrax.Softmax(logits=logits, temperature=temperature).probs
     return distrax.Categorical(
         probs=_mix_with_uniform(probs, epsilon)).entropy()

示例#11

显示文件

文件： distributions.py 项目： deepmind/rlax

 def log_prob_fn(sample: Array, logits: Array):
     probs = distrax.Softmax(logits=logits, temperature=temperature).probs
     return distrax.Categorical(
         probs=_mix_with_uniform(probs, epsilon)).log_prob(sample)

示例#12

显示文件

文件： distributions.py 项目： deepmind/rlax

 def sample_fn(key: Array, logits: Array):
     probs = distrax.Softmax(logits=logits, temperature=temperature).probs
     return distrax.Categorical(
         probs=_mix_with_uniform(probs, epsilon)).sample(seed=key)