Python ate_exploitability示例

编程语言: Python

命名空间/包名称: open_spiel.python.algorithms.adidas_utils.helpers.symmetric.exploitability

方法/功能: ate_exploitability

hotexamples.com的示例: 4

Python ate_exploitability - 已找到4个示例。这些是从开源项目中提取的最受好评的open_spiel.python.algorithms.adidas_utils.helpers.symmetric.exploitability.ate_exploitability现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： ate.py 项目： deepmind/open_spiel

    def exploitability(self, params, payoff_matrices):
        """Compute and return tsallis entropy regularized exploitability.

    Args:
      params: tuple of params (dist, y), see ate.gradients
      payoff_matrices: (>=2 x A x A) np.array, payoffs for each joint action
    Returns:
      float, exploitability of current dist
    """
        return exp.ate_exploitability(params, payoff_matrices, self.p)

示例#2

显示文件

文件： exploitability_test.py 项目： deepmind/open_spiel

 def test_ate_exploitability_of_rand(self, payoff_tensor, p, seed=None):
     trials = 100
     random = np.random.RandomState(seed)
     num_strategies = payoff_tensor.shape[-1]
     dists = random.rand(trials, num_strategies)
     dists /= np.sum(dists, axis=1, keepdims=True)
     exploitable = []
     for dist in dists:
         exp = exploitability.ate_exploitability(dist, payoff_tensor, p)
         exploitable.append(exp > 0.)
     perc = 100 * np.mean(exploitable)
     logging.info('rand strat exploitable rate out of %d is %f', trials,
                  perc)
     self.assertEqual(perc, 100., 'found rand strat that was nash')

示例#3

显示文件

文件： exploitability_test.py 项目： deepmind/open_spiel

 def test_ate_exploitability_of_non_nash(self, payoff_tensor, p, dist, exp):
     # assumes symmetric games
     exp_pred = exploitability.ate_exploitability(dist, payoff_tensor, p)
     self.assertAlmostEqual(exp_pred,
                            exp,
                            msg='dist should have the given exploitability')

示例#4

显示文件

文件： exploitability_test.py 项目： deepmind/open_spiel

 def test_ate_exploitability_of_nash(self, payoff_tensor, nash, p):
     # assumes symmetric games
     exp = exploitability.ate_exploitability(nash, payoff_tensor, p)
     self.assertGreaterEqual(
         0., exp, 'uniform nash should have zero exploitability')