Python MancalaEnv.clone 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: magent.mancala

클래스/타입: MancalaEnv

메소드/함수: clone

hotexamples.com에서의 예제들: 7

Python MancalaEnv.clone - 7개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 magent.mancala.MancalaEnv.clone에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MancalaEnv(10)

clone(7)

get_legal_moves(4)

perform_move(3)

compute_end_game_reward(2)

get_action_mask_with_no_pie(2)

is_game_over(2)

is_legal(2)

next_states(2)

get_player_utility(1)

get_winner(1)

is_legal_action(1)

make_move(1)

예제 #1

파일 보기

 def expand(parent: Node) -> Node:
     child_expansion_move = choice(tuple(parent.unexplored_moves))
     child_state = MancalaEnv.clone(parent.state)
     child_state.perform_move(child_expansion_move)
     child_node = Node(state=child_state,
                       move=child_expansion_move,
                       parent=parent)
     parent.put_child(child_node)
     MonteCarloTreePolicy._rave_expand(child_node)
     # go down the tree
     return child_node

예제 #2

파일 보기

    def _rave_expand(parent: Node):
        moves = [-1e80 for _ in range(parent.state.board.holes + 1)]
        for unexplored_move in parent.unexplored_moves.copy():
            child_state = MancalaEnv.clone(parent.state)
            child_state.perform_move(unexplored_move)
            moves[unexplored_move.index] = evaluation.get_score(
                state=child_state, parent_side=parent.state.side_to_move)

        moves_dist = np.asarray(moves, dtype=np.float64).flatten()
        exp = np.exp(moves_dist - np.max(moves_dist))
        dist = exp / np.sum(exp)
        parent.value = max(dist)

예제 #3

파일 보기

    def expand(self, node: AlphaNode):
        # Tactical workaround the pie move
        if Move(node.state.side_to_move, 0) in node.unexplored_moves:
            node.unexplored_moves.remove(Move(node.state.side_to_move, 0))

        dist, value = self.network.evaluate_state(node.state)
        for index, prior in enumerate(dist):
            expansion_move = Move(node.state.side_to_move, index + 1)
            if node.state.is_legal(expansion_move):
                child_state = MancalaEnv.clone(node.state)
                child_state.perform_move(expansion_move)
                child_node = AlphaNode(state=child_state,
                                       prior=prior,
                                       move=expansion_move,
                                       parent=node)
                node.put_child(child_node)
        # go down the tree
        return node_utils.select_child_with_maximum_action_value(node)

예제 #4

파일 보기

파일: mcts.py 프로젝트: crisbodnar/KalahAI

    def search(self, state: MancalaEnv) -> Move:
        # short circuit last move
        if len(state.get_legal_moves()) == 1:
            return state.get_legal_moves()[0]

        game_state_root = Node(state=MancalaEnv.clone(state))
        start_time = datetime.datetime.utcnow()
        games_played = 0
        while datetime.datetime.utcnow() - start_time < self.calculation_time:
            node = self.tree_policy.select(game_state_root)
            final_state = self.default_policy.simulate(node)
            self.rollout_policy.backpropagate(node, final_state)
            # Debugging information
            games_played += 1
            logging.debug("%s; Game played %i" % (node, games_played))
        logging.debug("%s" % game_state_root)
        chosen_child = node_utils.select_robust_child(game_state_root)
        logging.info("Choosing: %s" % chosen_child)
        return chosen_child.move

예제 #5

파일 보기

 def _make_temp_child(parent: Node, move: Move) -> MancalaEnv:
     child_state = MancalaEnv.clone(parent.state)
     child_state.perform_move(move)
     return child_state

예제 #6

파일 보기

 def __init__(self, state: MancalaEnv, action_taken: Move):
     self.state = MancalaEnv.clone(state)
     self.action_taken = Move.clone(action_taken)

예제 #7

파일 보기

    def test_cloning_immutability(self):
        clone = MancalaEnv.clone(self.game)
        self.game.perform_move(Move(Side.SOUTH, 3))

        self.assertEqual(clone.board.get_seeds(Side.SOUTH, 3), 7)
        self.assertEqual(clone.side_to_move, Side.SOUTH)