Python DQN.DQN_Agent 예제들

프로그래밍 언어: Python

클래스/타입: DQN

메소드/함수: DQN_Agent

hotexamples.com에서의 예제들: 2

Python DQN.DQN_Agent - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 DQN.DQN_Agent 패키지로부터 acme에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

DQN(30)

Agent(6)

DQNAgent(4)

DQN_Agent(2)

Mynet(2)

Memory(2)

ImageProcess(2)

ZeroGamaDQN(2)

DQNPlayer(2)

DQNT(1)

Choose_Action_EpsilonGreedy(1)

Transition(1)

Train(1)

ReplyMemory(1)

Q_Learning(1)

QNetwork(1)

Pw_Agent(1)

NoisyDQN(1)

DQNMethod(1)

DQN_VR(1)

IRL_helper(1)

GraphNet(1)

Cartpole(1)

Deep_Q_Learning(1)

DeepQNetwork(1)

DQNplayer(1)

DQNmodel(1)

Environment(1)

예제 #1

파일 보기

            action = action_to_one_hot_bool(agent.select_epsilon_greedy_action(state,epsilon))
            for i in range(frame_repeat):
                sleep(sleep_time)
                reward = game.make_action(action, 1)
            done = game.is_episode_finished()
            state = next_state
            reward_sum += reward
            if done:
                print('episode:', episode, 'sum_of_rewards_for_episode:', reward_sum)
                total_reward.append(reward_sum)
                break
    return agent, total_reward

if __name__ == "__main__":
    version = 'DDQN'
    config_file = 'deadly_corridor'
    checkpoint_file = './results/dqn/agent1/cp-0001.ckpt'
    frame_repeat = 12
    episodes = 5
    config_path = 'D:/Joe/Anaconda3/envs/tensorflow_env/Lib/site-packages/vizdoom/scenarios/'+config_file+'.cfg'
    game = initialize_vizdoom(config_path)
    action_size = game.get_available_buttons_size()
    print('action_size',action_size)
    state_size = np.array(game.get_state().screen_buffer.shape)
    if version=='DQN':
        agent = DQN.DQN_Agent(state_size,action_size,checkpoint_file=checkpoint_file)
    if version=='DDQN':
        agent = Double_DQN.DQN_Agent(state_size,action_size,checkpoint_file=checkpoint_file)
    _, total_reward = run_agent(agent, game, frame_repeat)
    game.close()

예제 #2

파일 보기

파일: train_doom.py 프로젝트: Joe-Withers/Double_DQN_Doom

    #parameters
    batch_size=6
    learning_rate=0.001
    folder=str(learning_rate)
    replay_buffer_size=100000
    m = 256
    episodes = 50
    n_agents = 1
    for version in versions:
        for config_file in config_files:
            config_path = 'D:/Joe/Anaconda3/envs/tensorflow_env/Lib/site-packages/vizdoom/scenarios/'+config_file+'.cfg'
            all_total_rewards = []
            for n in range(n_agents):
                game = initialize_vizdoom(config_path)
                action_size = game.get_available_buttons_size()
                print('action_size',action_size)
                state_size = np.array(game.get_state().screen_buffer.shape)
                if version=='DQN':
                    agent = DQN.DQN_Agent(state_size,action_size, batch_size=batch_size, learning_rate=learning_rate, replay_buffer_size=replay_buffer_size, checkpoint_file='./agent'+str(n)+'/cp-9999.ckpt')
                if version=='DDQN':
                    agent = Double_DQN.DQN_Agent(state_size,action_size, batch_size=batch_size, learning_rate=learning_rate, replay_buffer_size=replay_buffer_size, checkpoint_file='./agent'+str(n)+'/cp-9999.ckpt')
                _, total_reward = train_agent(agent, game, frame_repeat)
                all_total_rewards.append(total_reward)
                game.close()

                all_total_rewards_to_save = np.array(all_total_rewards)
                if version=='DQN':
                    np.save('./all_total_rewards_'+config_file+'_'+version+'.npy', all_total_rewards_to_save)
                if version=='DDQN':
                    np.save('./all_total_rewards_'+config_file+'_'+version+'.npy', all_total_rewards_to_save)