Python Quoridor.all_legal_movesの例

プログラミング言語: Python

名前空間/パッケージ名: quoridor

クラス/型: Quoridor

メソッド/関数: all_legal_moves

hotexamples.comのコード掲載数: 2

Python Quoridor.all_legal_moves - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのquoridor.Quoridor.all_legal_movesの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Quoridor(30)

placer_mur(4)

jouer_coup(3)

hash_key(3)

take_action(2)

all_legal_moves(2)

alter(2)

start_self_play(2)

valid_actions(1)

temp_move(1)

step(1)

start_test_play(1)

print_board(1)

pos2(1)

placer_mur_auto(1)

jouer_manuel_graph(1)

hori(1)

jouer_auto_graph(1)

jouer_auto_console(1)

__str__(1)

has_a_winner(1)

get_winner(1)

get_shortest_path(1)

exec_move(1)

current_player(1)

check_end(1)

afficher(1)

actions(1)

_self_loc(1)

_oppo_loc(1)

ver(1)

コード例 #1

ファイルを表示

ファイル: mcts.py プロジェクト: wrongu/QuoridorV2

 def __init__(self, game_state:Quoridor, policy_output, value_output):
     # _counts is the number of times we've taken some action *from this state*. Initialized to all zeros. Stored
     # as a torch tensor over all possible actions, to be later masked with the set of legal actions
     self._counts = torch.zeros(3, 9, 9)
     self._total_reward = torch.zeros(3, 9, 9)
     self._policy = policy_output
     self._value = value_output
     self._legal_mask = encode_actions_to_planes(game_state.all_legal_moves(), game_state.current_player)
     self._player = game_state.current_player
     self._key = game_state.hash_key()
     self._children = {}
     self.__flagged = False

コード例 #2

ファイルを表示

                              col) + "v"

    if temperature < 1e-6:
        # Do max operation instead of unstable low-temperature manipulations
        idx = torch.argmax(policy_planes)
    else:
        idx = torch.multinomial(policy_planes.flatten()**temperature,
                                num_samples=1)
    return _idx_to_action(idx.item())


if __name__ == '__main__':
    # mini test
    q = Quoridor()

    legal_moves = q.all_legal_moves(partial_check=False)
    print("INITIAL STATE LEGAL MOVES ({} of them):".format(len(legal_moves)))
    print(legal_moves)

    for mv in legal_moves:
        planes = encode_actions_to_planes(mv, q.current_player)
        print("=========== {} ============".format(mv))
        print(planes)
        mv2 = sample_action(planes, 0)
        print(mv2)
        assert mv2 == mv, "Failed to encode/decode {}".format(mv)

    # Test that just sampling random moves leads to some illegal moves getting selected (this is expected)
    random_actions, masked_random_actions = [''] * 100, [''] * 100
    legal_mask = encode_actions_to_planes(legal_moves, q.current_player)
    for i in range(100):