Python MancalaEnv.get_action_mask_with_no_pie示例

编程语言: Python

命名空间/包名称: magent.mancala

类/类型: MancalaEnv

方法/功能: get_action_mask_with_no_pie

hotexamples.com的示例: 2

Python MancalaEnv.get_action_mask_with_no_pie - 已找到2个示例。这些是从开源项目中提取的最受好评的magent.mancala.MancalaEnv.get_action_mask_with_no_pie现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

MancalaEnv(10)

clone(7)

get_legal_moves(4)

perform_move(3)

compute_end_game_reward(2)

get_action_mask_with_no_pie(2)

is_game_over(2)

is_legal(2)

next_states(2)

get_player_utility(1)

get_winner(1)

is_legal_action(1)

make_move(1)

示例#1

显示文件

    def evaluate_state(self, env: MancalaEnv) -> (float, float):
        flip_board = env.side_to_move == Side.NORTH
        state = env.board.get_board_image(flipped=flip_board)
        mask = env.get_action_mask_with_no_pie()
        dist, _, value = self.network.evaluate_move(state=state, mask=mask)

        return dist, float(value)

示例#2

显示文件

 def sample_state(self, env: MancalaEnv) -> (int, float):
     flip_board = env.side_to_move == Side.NORTH
     state = env.board.get_board_image(flipped=flip_board)
     mask = env.get_action_mask_with_no_pie()
     return self.network.sample(state=state, mask=mask)