Python Agent.max_action示例

编程语言: Python

命名空间/包名称: agents.Agent

类/类型: Agent

方法/功能: max_action

hotexamples.com的示例: 2

Python Agent.max_action - 已找到2个示例。这些是从开源项目中提取的最受好评的agents.Agent.Agent.max_action现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

__init__(9)

Agent(6)

cuda(4)

clear(2)

max_action(2)

finish_episode(1)

initGame(1)

makeMove(1)

manual_distribution(1)

sample_batch(1)

start(1)

values(1)

示例#1

显示文件

文件： Pursuit.py 项目： mike-gimelfarb/mfpy

    def act(self, Q: Agent, task: Task, state):

        # set the number of actions of the current task, if not set
        if self.valid_actions == 0:
            self.valid_actions = task.valid_actions()

        # get the distribution over actions for the current state
        pref = self.preferences[state]

        # sample an action from the preference distribution
        action = np.random.choice(self.valid_actions, 1, p=pref)

        # get the greedy action according to Q
        greedy = Q.max_action(state)

        # update the preference distribution
        pref *= (1.0 - self.beta)
        pref[greedy] /= (1.0 - self.beta)
        pref[greedy] += self.beta * (1.0 - pref[greedy])

        return action

示例#2

显示文件

 def epsilon_greedy(self, Q: Agent, task: Task, state, epsilon):
     if np.random.rand() <= epsilon:
         return random.randrange(task.valid_actions())
     else:
         return Q.max_action(state)