Python RandomAgent.updateの例

プログラミング言語: Python

名前空間/パッケージ名: agent

クラス/型: RandomAgent

メソッド/関数: update

hotexamples.comのコード掲載数: 2

Python RandomAgent.update - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのagent.RandomAgent.updateの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

RandomAgent(30)

act(5)

load_model(2)

play(2)

select_action(2)

update(2)

action(1)

chooseAction(1)

evaluate_action(1)

getPoint(1)

get_action(1)

player(1)

reset(1)

step(1)

take_turn(1)

コード例 #1

ファイルを表示

ファイル: main.py プロジェクト: radia408/MDP

def markovDecision(layout, circle):
    env = SnakesAndLadder(layout, circle)
    agent = RandomAgent(env.action_space)

    n_episodes = 50

    for episode in range(n_episodes):
        state = env.reset()
        done = False
        while not done:
            action = agent.select_action(state)
            next_state, reward, done = env.step(action)

            agent.update(state, action, reward, next_state)

            state = next_state

コード例 #2

ファイルを表示

possible_actions = [0, 1]  # Cooperate or Defect
cooperator, defector = RandomAgent(possible_actions, p=0.9), RandomAgent(possible_actions, p=0.1)

# Stateless interactions (agents do not have memory)
s = None

n_iter = 1000
for i in range(n_iter):

    # A full episode:
    done = False

    while not done:

        # Agents decide
        a0 = cooperator.act()
        a1 = defector.act()

        # World changes
        new_s, (r0, r1), done, _ = env.step(([a0], [a1]))

        # Agents learn
        cooperator.update(s, (a0, a1), (r0, r1), new_s )
        defector.update(s, (a1, a0), (r1, r0), new_s )

        s = new_s
        print(r0, r1)

    env.reset()