Python flip_policy Examples

Programming Language: Python

Namespace/Package Name: cchess_alphazero.environment.lookup_tables

Method/Function: flip_policy

Examples at hotexamples.com: 2

Python flip_policy - 2 examples found. These are the top rated real world Python examples of cchess_alphazero.environment.lookup_tables.flip_policy extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: sl_onegreen.py Project: denniscxc/cxc

    def build_policy(self, action, flip):
        labels_n = len(ActionLabelsRed)
        move_lookup = {move: i for move, i in zip(ActionLabelsRed, range(labels_n))}
        policy = np.zeros(labels_n)

        policy[move_lookup[action]] = 1

        if flip:
            policy = flip_policy(policy)
        return policy

Example #2

Show file

File: player.py Project: wonderkid27/ChineseChess-AlphaZero

 def action(self, env: CChessEnv) -> str:
     value = self.search_moves(env)  # MCTS search
     policy = self.calc_policy(
         env)  # policy will not be flipped in `calc_policy`
     if not env.red_to_move:
         pol = flip_policy(policy)
     else:
         pol = policy
     my_action = int(
         np.random.choice(range(self.labels_n),
                          p=self.apply_temperature(pol, env.num_halfmoves)))
     # my_action = np.argmax(self.apply_temperature(pol, env.num_halfmoves))
     # no resign
     self.moves.append([env.observation, list(policy)
                        ])  # do not need flip anymore when training
     return self.labels[my_action]