Python MancalaEnv.next_states Examples

Programming Language: Python

Namespace/Package Name: magent.mancala

Class/Type: MancalaEnv

Method/Function: next_states

Examples at hotexamples.com: 2

Python MancalaEnv.next_states - 2 examples found. These are the top rated real world Python examples of magent.mancala.MancalaEnv.next_states extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

MancalaEnv(10)

clone(7)

get_legal_moves(4)

perform_move(3)

compute_end_game_reward(2)

get_action_mask_with_no_pie(2)

is_game_over(2)

is_legal(2)

next_states(2)

get_player_utility(1)

get_winner(1)

is_legal_action(1)

make_move(1)

Example #1

Show file

File: alphabeta.py Project: crisbodnar/KalahAI

    def _alpha_beta_search(game: MancalaEnv, alpha=-np.inf, beta=np.inf, depth=5):
        """Search game to determine best action; use alpha-beta pruning.
        This version cuts off search and uses an evaluation function."""
        if depth == 0 or game.is_game_over():
            return game.get_player_utility()

        if game.side_to_move == Side.SOUTH:
            v = -np.inf
            for (_, new_s) in game.next_states():
                v = max(v, AlphaBeta._alpha_beta_search(new_s, alpha, beta, depth - 1))
                alpha = max(alpha, v)
                # if beta <= alpha:
                #     break
        else:
            v = np.inf
            for (_, new_s) in game.next_states():
                v = min(v, AlphaBeta._alpha_beta_search(new_s, alpha, beta, depth - 1))
                beta = min(beta, v)
                # if beta <= alpha:
                #     break
        return v

Example #2

Show file

File: alphabeta.py Project: crisbodnar/KalahAI

    def search(self, game: MancalaEnv) -> Move:
        values = [(a, self._alpha_beta_search(game=state, depth=self.depth)) for a, state in game.next_states()]
        np.random.shuffle(values)

        if game.side_to_move == Side.SOUTH:
            action, _ = max(values, key=lambda x: x[1])
        else:
            action, _ = min(values, key=lambda x: x[1])
        return action