Python ApproximateKLDivergence Examples

Programming Language: Python

Namespace/Package Name: trax.rl.rl_layers

Method/Function: ApproximateKLDivergence

Examples at hotexamples.com: 2

Python ApproximateKLDivergence - 2 examples found. These are the top rated real world Python examples of trax.rl.rl_layers.ApproximateKLDivergence extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: actor_critic_joint.py Project: srush/trax

 def f(dist_inputs, values, returns, actions, old_log_probs):
   del values, returns
   return rl_layers.ApproximateKLDivergence(
       dist_inputs,
       actions,
       old_log_probs,
       log_prob_fun=self._policy_dist.log_prob)

Example #2

Show file

File: actor_critic_joint.py Project: hugochan/trax

 def approximate_kl_divergence(self):
   """Entropy layer."""
   return tl.Fn(
       lambda dist_inputs, actions, old_log_probs:
       rl_layers.ApproximateKLDivergence(
           dist_inputs,
           actions,
           old_log_probs,
           log_prob_fun=self._policy_dist.log_prob),
       n_out=1)