Esempi in Python per GoGame.get_batch_next_states

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: gym_go.gogame

Classe/tipologia: GoGame

Metodo/funzione: get_batch_next_states

Esempi su hotexamples.com: 2

GoGame.get_batch_next_states in Python: 2 esempi trovati. Questi sono i migliori esempi reali in Python per gym_go.gogame.GoGame.get_batch_next_states, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

get_game_ended(7)

get_init_board(6)

get_action_size(3)

get_areas(3)

get_canonical_form(3)

get_prev_player_passed(3)

get_turn(3)

get_batch_next_states(2)

get_children(2)

get_next_state(2)

GoGame(1)

get_next_states(1)

get_valid_moves(1)

get_winning(1)

str(1)

Esempio n. 1

Mostra file

    def step(self, action):
        '''
        Assumes the correct player is making a move. Black goes first.
        return observation, reward, done, info
        '''
        assert not self.done
        if isinstance(action, tuple) or isinstance(action, list) or isinstance(
                action, np.ndarray):
            assert 0 <= action[0] < self.size
            assert 0 <= action[1] < self.size
            action = self.size * action[0] + action[1]
        elif action is None:
            action = self.size**2

        actions = np.array([action])
        states, group_maps = GoGame.get_batch_next_states(
            self.state, actions, self.group_map)
        self.state, self.group_map = states[0], group_maps[0]
        self.done = GoGame.get_game_ended(self.state)
        return np.copy(
            self.state), self.get_reward(), self.done, self.get_info()

Esempio n. 2

Mostra file

    def step_batch(self, state, action):
        '''
        Assumes the correct player is making a move. Black goes first.
        return observation, reward, done, info
        But next step will not change the previous state
        '''
        assert not self.done
        if isinstance(action, tuple) or isinstance(action, list) or isinstance(
                action, np.ndarray):
            assert 0 <= action[0] < self.size
            assert 0 <= action[1] < self.size
            action = self.size * action[0] + action[1]
        elif action is None:
            action = self.size**2

        actions = np.array([action])
        next_states, next_group_maps = GoGame.get_batch_next_states(
            state, actions, self.group_map)
        next_state, next_group_map = next_states[0], next_group_maps[0]
        next_done = GoGame.get_game_ended(next_state)
        return np.copy(next_state), self.get_reward_batch(
            next_state, next_done), next_done, self.get_info_batch(next_state)