Python HRL.augment_goals示例

编程语言: Python

命名空间/包名称: hrl

类/类型: HRL

方法/功能: augment_goals

hotexamples.com的示例: 1

Python HRL.augment_goals - 已找到1个示例。这些是从开源项目中提取的最受好评的hrl.HRL.augment_goals现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

HRL(6)

close(5)

checkEscape(3)

flip(3)

newTexture(3)

act(1)

augment_goals(1)

changeBackground(1)

readButton(1)

remember(1)

replay(1)

writeResultLine(1)

示例#1

显示文件

def run():
    env = gym.make('CartPole-v0')
    env._max_episode_steps = None

    state_size = env.observation_space.shape[0]
    action_size = env.action_space.n
    agent = HRL(state_size, action_size)

    horizon = 20

    for e in range(agent.episodes):
        state = torch.tensor(env.reset(), dtype=torch.float)
        goal = torch.tensor([0.0, 0.0, 0.0, 0.0], dtype=torch.float)
        score = 0

        # Rollout
        for t in range(horizon):
            score += 1
            env.render()
            action = agent.act(state, goal,
                               torch.tensor([horizon - t], dtype=torch.float))
            next_state, _, done, _ = env.step(action)
            agent.remember(state, action, next_state, None, None, done)
            agent.augment_goals(state, action, next_state, done)
            state = next_state
            if done:
                print("episode: {}/{}, score: {}, e: {:.2}".format(
                    e, agent.episodes, score, agent.epsilon))
                break

        # Perform optimization
        for _ in range(agent.n):
            agent.replay()