Exemplos de ReinforcementAgent.getLegalActions em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: learningAgents

Classe / Tipo: ReinforcementAgent

Método / Função: getLegalActions

Exemplos em hotexamples.com: 3

ReinforcementAgent.getLegalActions em Python - 3 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de learningAgents.ReinforcementAgent.getLegalActions em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

__init__(30)

final(6)

getLegalActions(3)

doAction(1)

registerInitialState(1)

startEpisode(1)

Métodos Frequentes

__init__ (30)

final (6)

getLegalActions (3)

doAction (1)

registerInitialState (1)

startEpisode (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: qlearningAgents.py Projeto: lm5lm5/CS3243_Project_2

def computeValueFromQValues(self, state): """ Returns max_action Q(state,action) where the max is over legal actions. Note that if there are no legal actions, which is the case at the terminal state, you should return a value of 0.0. """ actions = ReinforcementAgent.getLegalActions(self, state) if len(actions) < 1: return 0.0 else: ## max value among all actions max = -float("inf") for action in actions: val = self.getQValue(state, action) if max < val: max = val return max

Exemplo n.º 2

0

Exibir arquivo

Arquivo: qlearningAgents.py Projeto: lm5lm5/CS3243_Project_2

def computeActionFromQValues(self, state): """ Compute the best action to take in a state. Note that if there are no legal actions, which is the case at the terminal state, you should return None. """ actions = ReinforcementAgent.getLegalActions(self, state) if len(actions) < 1: return None else: max = -float("inf") action = None for a in actions: val = self.getQValue(state, a) if max < val: max = val action = a return action

Exemplo n.º 3

0

Exibir arquivo

Arquivo: qlearningAgents.py Projeto: lm5lm5/CS3243_Project_2

def getAction(self, state): """ Compute the action to take in the current state. With probability self.epsilon, we should take a random action and take the best policy action otherwise. Note that if there are no legal actions, which is the case at the terminal state, you should choose None as the action. HINT: You might want to use util.flipCoin(prob) HINT: To pick randomly from a list, use random.choice(list) """ # Pick Action legalActions = ReinforcementAgent.getLegalActions(self, state) if len(legalActions) == 0: return None else: isRandom = util.flipCoin(self.epsilon) if isRandom: return random.choice(legalActions) else: return self.computeActionFromQValues(state)