Exemplos de action_index em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: pypokerai.value_function

Método / Função: action_index

Exemplos em hotexamples.com: 3

action_index em Python - 3 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de pypokerai.value_function.action_index em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Exemplo n.º 1

0

Exibir arquivo

def backup_on_minibatch(self, q_network, backup_minibatch): X = np.array([self.delegate.construct_features(state, action)[0] for state, action, target in backup_minibatch]) Y_info = [(action, target) for _state, action, target in backup_minibatch] Y = q_network.predict_on_batch(X) assert len(Y) == len(Y_info) for y, (action, target) in zip(Y, Y_info): y[action_index(action)] = target loss = q_network.train_on_batch(X, Y)

Exemplo n.º 2

0

Exibir arquivo

Arquivo: deep_sarsa_trial.py Projeto: ishikota/PyPokerAI

def backup_on_minibatch(self, backup_minibatch): X = np.array([self.delegate.construct_features(state, action)[0] for state, action, target in backup_minibatch]) Y_info = [(action, target) for _state, action, target in backup_minibatch] Y = self.delegate.model.predict_on_batch(X) assert len(Y) == len(Y_info) for y, (action, target) in zip(Y, Y_info): y[action_index(action)] = target loss = self.delegate.model.train_on_batch(X, Y) self.delegate.loss_history.append(loss) self.delegate.prediction_cache = (None, None)

Exemplo n.º 3

0

Exibir arquivo

def predict_value_by_network(self, network, state, action): X, action = self.delegate.construct_features(state, action) values = network.predict_on_batch(np.array([X]))[0].tolist() valur_for_action = values[action_index(action)] return valur_for_action