Python DQNAgent.remember Examples

Programming Language: Python

Namespace/Package Name: rl.agents.dqn

Class/Type: DQNAgent

Method/Function: remember

Examples at hotexamples.com: 1

Python DQNAgent.remember - 1 examples found. These are the top rated real world Python examples of rl.agents.dqn.DQNAgent.remember extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

DQNAgent(30)

compile(30)

load_weights(30)

fit(30)

save_weights(30)

test(30)

forward(7)

processor(3)

target_model(3)

compute_batch_q_values(3)

compute_q_values(2)

test_policy(2)

backward(2)

training(2)

policy(2)

select_action(1)

save_model(1)

reset_states(1)

replay(1)

remember(1)

reload_memory(1)

reload(1)

model(1)

process_state_batch(1)

modelfile(1)

X(1)

memoryfile(1)

learning(1)

get_config(1)

enable_dueling_network(1)

cmopile(1)

act(1)

_build_model(1)

__init__(1)

Y(1)

update_target_model(1)

Example #1

Show file

 # Our goal is to keep the pole upright as long as possible until score of 500
 # the more time the more score
 for time in range(700):
     env.render()
     # Decide action
     action = agent.act(state) # maximum action ; pass our vector state to our NN in which we have state_size neurons
     # Advance the game to the next frame based on the action.
     # Reward is 1 for every frame the pole survived
     next_state, reward, done, _ = env.step(action)
     reward = reward if not done else -10
     # we are turning our next_state into a one dimensional matrix which is a vector
     # to calculate the maximum future reward for next state ; cause our model input 
     # is a one dimensional matrix which is a vector in which in our case is 4 neurons
     next_state = np.reshape(next_state, [1, state_size])
     # Remember the previous state, action, reward, and done
     agent.remember(state, action, reward, next_state, done)
     # make next_state the new current state for the next frame.
     state = next_state
     # done becomes True when the game ends
     # ex) The agent drops the pole
     if done:
         agent.update_target_model()
         print("episode: {}/{}, score: {}, e: {:.2}"
               .format(e, EPISODES, time, agent.epsilon))
         break
     if len(agent.memory) > batch_size:
         # train the agent with the experience of the episode
         # loss = agent.replay(batch_size)
         agent.replay(batch_size)
         # Logging training loss every 10 timesteps
         # if time % 10 == 0: