Python DQNAgent.loss Examples

Programming Language: Python

Namespace/Package Name: dqn

Class/Type: DQNAgent

Method/Function: loss

Examples at hotexamples.com: 1

Python DQNAgent.loss - 1 examples found. These are the top rated real world Python examples of dqn.DQNAgent.loss extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

DQNAgent(30)

act(13)

load(11)

compile(8)

fit(5)

save(5)

train(5)

replay(5)

test(4)

save_weights(4)

remember(4)

get_action(4)

load_model(4)

actDeterministically(4)

epsilon(3)

save_model(3)

load_weights(3)

target_model(2)

observe(2)

start(2)

get_last_observations(2)

end(2)

train_one_episode(1)

train_model(1)

trainAgent(1)

train_only(1)

update_epoch(1)

update_replay_memory(1)

test_one_episode(1)

test_model(1)

update_target(1)

store_transition(1)

train_rnn(1)

testAgent(1)

update_target_model(1)

train_vae(1)

training(1)

restart_epoch(1)

store_experience(1)

load_state_dict(1)

__init__(1)

act_2(1)

append_sample(1)

backword(1)

fill_memory(1)

get_test_loss(1)

learn(1)

loss(1)

step(1)

parameters(1)

Example #1

Show file

File: longshort.py Project: Tuaman/CS229project

                                    price,
                                    action,
                                    reward,
                                    loss=agent.loss)
                        #print(action, reward)

                    state = next_state
                    if done:
                        print('start', env.start, 'previous', (cash, nown),
                              'current', tuple(env.holdings))
                        print("episode: {}/{}, score: {}, e: {:.5}".format(
                            e, EPISODES, time, agent.epsilon))
                        print('average_loss =', agent.loss / env.init['span'])
                        f.write(str(agent.loss) + '\n')
                        f.flush()
                        agent.loss = 0
                        if e % 2 == 0:
                            grapher.show(action_labels=env.action_labels,
                                         ep=e,
                                         t=time,
                                         e=agent.epsilon)
                            grapher.reset()
                            agent.save(save_string)
                        break
                    # if len(agent.memory) > batch_size:
                    #     agent.replay(batch_size)

                # Test
                if e % 2 == 0:
                    state = test_env.reset()
                    state = np.reshape(state, [1, state_size])