Python child 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: open_spiel.python.policy

메소드/함수: child

hotexamples.com에서의 예제들: 5

Python child - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 open_spiel.python.policy.child에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

 def test_child_function_expected_behavior_for_sim_game(self):
     """Test expected behavior of child on simultaneous games."""
     game = pyspiel.load_game("python_iterated_prisoners_dilemma")
     parameter_state = game.new_initial_state()
     actions = [1, 1]
     new_state = policy.child(parameter_state, actions)
     self.assertEqual(str(new_state), ("p0:D p1:D"))

예제 #2

파일 보기

 def test_child_function_expected_behavior_for_seq_game(self):
     """Test expected behavior of child on sequential games."""
     game = pyspiel.load_game("tic_tac_toe")
     initial_state = game.new_initial_state()
     action = 3
     new_state = policy.child(initial_state, action)
     self.assertNotEqual(new_state.history(), initial_state.history())
     expected_new_state = initial_state.child(action)
     self.assertNotEqual(new_state, expected_new_state)
     self.assertEqual(new_state.history(), expected_new_state.history())

예제 #3

파일 보기

 def decision_nodes(self, parent_state):
     """Yields a (state, cf_prob) pair for each descendant decision node."""
     if not parent_state.is_terminal():
         if (parent_state.current_player() == self._player_id
                 or parent_state.is_simultaneous_node()):
             yield (parent_state, 1.0)
         for action, p_action in self.transitions(parent_state):
             for state, p_state in self.decision_nodes(
                     openspiel_policy.child(parent_state, action)):
                 yield (state, p_state * p_action)

예제 #4

파일 보기

파일: expected_game_score.py 프로젝트: sarahperrin/open_spiel

def policy_value(state,
                 policies: Union[List[policy.Policy], policy.Policy],
                 probability_threshold: float = 0):
    """Returns the expected values for the state for players following `policies`.

  Computes the expected value of the`state` for each player, assuming player `i`
  follows the policy given in `policies[i]`.

  Args:
    state: A `pyspiel.State`.
    policies: A `list` of `policy.Policy` objects, one per player for sequential
      games, one policy for simulatenous games.
    probability_threshold: only sum over entries with prob greater than this
      (default: 0).

  Returns:
    A `numpy.array` containing the expected value for each player.
  """
    if state.is_terminal():
        return np.array(state.returns())
    else:
        return sum(prob * policy_value(policy.child(state, action), policies)
                   for action, prob in _transitions(state, policies)
                   if prob > probability_threshold)

예제 #5

파일 보기

 def test_child_function_failure_behavior_for_sim_game(self):
     """Test failure behavior of child on simultaneous games."""
     game = pyspiel.load_game("python_iterated_prisoners_dilemma")
     parameter_state = game.new_initial_state()
     with self.assertRaises(AssertionError):
         policy.child(parameter_state, 0)