Python DQNAgent.act Examples

Programming Language: Python

Namespace/Package Name: rl.agents.dqn

Class/Type: DQNAgent

Method/Function: act

Examples at hotexamples.com: 1

Python DQNAgent.act - 1 examples found. These are the top rated real world Python examples of rl.agents.dqn.DQNAgent.act extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

DQNAgent(30)

compile(30)

load_weights(30)

fit(30)

save_weights(30)

test(30)

forward(7)

processor(3)

target_model(3)

compute_batch_q_values(3)

compute_q_values(2)

test_policy(2)

backward(2)

training(2)

policy(2)

select_action(1)

save_model(1)

reset_states(1)

replay(1)

remember(1)

reload_memory(1)

reload(1)

model(1)

process_state_batch(1)

modelfile(1)

X(1)

memoryfile(1)

learning(1)

get_config(1)

enable_dueling_network(1)

cmopile(1)

act(1)

_build_model(1)

__init__(1)

Y(1)

update_target_model(1)

Example #1

Show file

    smodel.summary()
    # agent.load("cartpole-dqn.h5")
    done = False
    batch_size = 32

    for e in range(EPISODES):
        # reset state in the beginning of each game
        state = env.reset()
        state = np.reshape(state, [1, state_size]) # turn the state into a one dimensional matrix which is a vector
        # time represents each frame of the game
        # Our goal is to keep the pole upright as long as possible until score of 500
        # the more time the more score
        for time in range(700):
            env.render()
            # Decide action
            action = agent.act(state) # maximum action ; pass our vector state to our NN in which we have state_size neurons
            # Advance the game to the next frame based on the action.
            # Reward is 1 for every frame the pole survived
            next_state, reward, done, _ = env.step(action)
            reward = reward if not done else -10
            # we are turning our next_state into a one dimensional matrix which is a vector
            # to calculate the maximum future reward for next state ; cause our model input 
            # is a one dimensional matrix which is a vector in which in our case is 4 neurons
            next_state = np.reshape(next_state, [1, state_size])
            # Remember the previous state, action, reward, and done
            agent.remember(state, action, reward, next_state, done)
            # make next_state the new current state for the next frame.
            state = next_state
            # done becomes True when the game ends
            # ex) The agent drops the pole
            if done: