Python DQNAgent.learning Examples

Programming Language: Python

Namespace/Package Name: rl.agents.dqn

Class/Type: DQNAgent

Method/Function: learning

Examples at hotexamples.com: 1

Python DQNAgent.learning - 1 examples found. These are the top rated real world Python examples of rl.agents.dqn.DQNAgent.learning extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

DQNAgent(30)

compile(30)

load_weights(30)

fit(30)

save_weights(30)

test(30)

forward(7)

processor(3)

target_model(3)

compute_batch_q_values(3)

compute_q_values(2)

test_policy(2)

backward(2)

training(2)

policy(2)

select_action(1)

save_model(1)

reset_states(1)

replay(1)

remember(1)

reload_memory(1)

reload(1)

model(1)

process_state_batch(1)

modelfile(1)

X(1)

memoryfile(1)

learning(1)

get_config(1)

enable_dueling_network(1)

cmopile(1)

act(1)

_build_model(1)

__init__(1)

Y(1)

update_target_model(1)

Example #1

Show file

File: duel_dqn.py Project: zachary2wave/UAV

               memory=memory,
               nb_steps_warmup=100,
               enable_dueling_network=True,
               dueling_type='avg',
               target_model_update=1e-3,
               policy=policy)
dqn.compile(Adam(lr=1e-4), metrics=['mae'])
# Okay, now it's time to learn something! We visualize the training here for show, but this
# slows down training quite a lot. You can always safely abort the training prematurely using
# Ctrl + C.

history = dqn.learning(env,
                       Given_policy,
                       policy_list,
                       nb_steps=5e6,
                       visualize=False,
                       log_interval=1000,
                       verbose=2,
                       nb_max_episode_steps=1000,
                       imitation_leaning_time=0,
                       reinforcement_learning_time=1e10)
sio.savemat(ENV_NAME + '-' + nowtime + '/fit.mat', history.history)
# After training is done, we save the final weights.

dqn.save_weights(ENV_NAME + '-' + nowtime + '/fit-weights.h5f', overwrite=True)

# Finally, evaluate our algorithm for 5 episodes.
history = dqn.test(env,
                   nb_episodes=10,
                   visualize=True,
                   nb_max_episode_steps=5000)
sio.savemat(ENV_NAME + '-' + nowtime + '/test.mat', history.history)