Python DQNAgent.training 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: rl.agents.dqn

클래스/타입: DQNAgent

메소드/함수: training

hotexamples.com에서의 예제들: 2

Python DQNAgent.training - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 rl.agents.dqn.DQNAgent.training에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

DQNAgent(30)

compile(30)

load_weights(30)

fit(30)

save_weights(30)

test(30)

forward(7)

processor(3)

target_model(3)

compute_batch_q_values(3)

compute_q_values(2)

test_policy(2)

backward(2)

training(2)

policy(2)

select_action(1)

save_model(1)

reset_states(1)

replay(1)

remember(1)

reload_memory(1)

reload(1)

model(1)

process_state_batch(1)

modelfile(1)

X(1)

memoryfile(1)

learning(1)

get_config(1)

enable_dueling_network(1)

cmopile(1)

act(1)

_build_model(1)

__init__(1)

Y(1)

update_target_model(1)

예제 #1

파일 보기

파일: breakout_dqn_agent_physics_random.py 프로젝트: mcnuggets-lab/rl-project

    #     dqn.load_weights(checkpoint_weights_filename)
    # elif os.path.isfile(weights_filename):
    #     print("Loading previous weights...")
    #     dqn.load_weights(weights_filename)
    dqn.fit(env, callbacks=callbacks, nb_steps=20000000, log_interval=10000)

    # After training is done, we save the final weights one more time.
    dqn.save_weights(weights_filename, overwrite=True)

    # Finally, evaluate our algorithm for 10 episodes.
    dqn.test(env, nb_episodes=10, visualize=False)
elif args.mode == 'test':
    weights_filename = 'wts/dqn_Breakout-v0_weights_12000000_phyran.h5f'.format(
        args.env_name)
    if args.weights:
        weights_filename = args.weights
    print(env.unwrapped.get_action_meanings())
    np.random.seed(None)
    env.seed(None)
    dqn.load_weights(weights_filename)
    dqn.training = False
    dqn.test_policy = EpsilonPhysicsPolicy(
        eps_phy=0.01, eps_ran=0.00
    )  # set a small epsilon for test policy to avoid getting stuck
    env = gym.wrappers.Monitor(env,
                               "records/",
                               video_callable=lambda episode_id: True,
                               force=True)
    dqn.test(env, nb_episodes=100, visualize=False)
    env.close()

예제 #2

파일 보기

파일: bot-rl_v0.1.py 프로젝트: tau-lex/market-analysis-system

## Init RL agent
agent = DQNAgent(model=model, nb_actions=nb_actions,
    memory=memory, nb_steps_warmup=1000,
    target_model_update=1e-2, policy=policy,
    processor=MultiInputProcessor(2),
    # enable_dueling_network=True, dueling_type='avg'
)
agent.compile(Adam(lr=1e-3), metrics=['mae'])

## Comment this row if you want to start learning again
agent.load_weights('{p}/dqn_{fn}_weights.h5f'.format(p=PATH, fn=ENV_NAME))

## Train or evaluate
if TRAIN:
    agent.training = True

observation = market.reset()

while True:
    try:
        # TODO add callbacks?

        ## Agent vybiraet dejstvie
        # (candles=9(mb=>(2,4)?), tickers=4, trades=2)
        # TODO actions for multy symbols market
        action = agent.forward(observation)

        ## Execute action
        observation, reward, done, info = market.step([action])