Exemplos de ActionHandler.set_legal_actions em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: reinforcepy.handlers

Classe / Tipo: ActionHandler

Método / Função: set_legal_actions

Exemplos em hotexamples.com: 3

ActionHandler.set_legal_actions em Python - 3 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de reinforcepy.handlers.ActionHandler.set_legal_actions em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

ActionHandler(5)

anneal_to(3)

get_action(3)

get_random(3)

action_vect_to_game_action(2)

set_legal_actions(2)

anneal(1)

curr_rand_val(1)

game_action_to_action_ind(1)

Métodos Frequentes

ActionHandler (5)

anneal_to (3)

get_action (3)

get_random (3)

action_vect_to_game_action (2)

set_legal_actions (2)

anneal (1)

curr_rand_val (1)

game_action_to_action_ind (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: test_actionhandler.py Projeto: tonylibing/reinforcepy

def test_set_legal_actions(action_handler: ActionHandler): # test to make sure action raises error on matrix input with pytest.raises(AssertionError): action_handler.set_legal_actions([[0, 2, 4, 6]]) action_handler.set_legal_actions([0, 2, 4, 6]) assert action_handler.numActions == 4

Exemplo n.º 2

0

Exibir arquivo

Arquivo: test_actionhandler.py Projeto: Islandman93/reinforcepy

def test_set_legal_actions(action_handler: ActionHandler): # test to make sure action raises error on matrix input with pytest.raises(AssertionError): action_handler.set_legal_actions([[0, 2, 4, 6]]) action_handler.set_legal_actions([0, 2, 4, 6]) assert action_handler.numActions == 4

Exemplo n.º 3

0

Exibir arquivo

Arquivo: AsyncA3CLearner.py Projeto: pratikgadiya12/reinforcepy

class AsyncProcessA3CLearner(AsyncProcessClient): def __init__(self, num_actions, initial_cnn_values, cnn_partial, pipe, skip_frame=4, phi_length=4, async_update_step=5): super().__init__(pipe) # A3C doesn't have an EGreedy exploration policy so we set the random values to 0 self.action_handler = ActionHandler((0, 0, 2)) # initialize network self.cnn = cnn_partial() self.cnn.set_parameters(initial_cnn_values) self.frame_buffer = np.zeros((1, phi_length, 84, 84), dtype=np.float32) self.skip_frame = skip_frame self.phi_length = phi_length self.loss_list = list() self.async_update_step = async_update_step def add_state_to_buffer(self, state): self.frame_buffer[0, 0:self.phi_length-1] = self.frame_buffer[0, 1:self.phi_length] self.frame_buffer[0, self.phi_length-1] = state def frame_buffer_with(self, state): empty_buffer = np.zeros((1, self.phi_length, 84, 84), dtype=np.float32) empty_buffer[0, 0:self.phi_length-1] = self.frame_buffer[0, 1:self.phi_length] empty_buffer[0, self.phi_length-1] = state return empty_buffer def get_action(self, frame_buffer): return self.cnn.get_policy_output(frame_buffer)[0] def get_game_action(self, frame_buffer): action = self.get_action(frame_buffer) return self.action_handler.action_vect_to_game_action(action, random=False) def set_legal_actions(self, legal_actions): self.action_handler.set_legal_actions(legal_actions)