Python DQNAgent 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: EURUSDagent

클래스/타입: DQNAgent

hotexamples.com에서의 예제들: 2

Python DQNAgent - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 EURUSDagent.DQNAgent에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

act(4)

DQNAgent(1)

load(1)

remember(1)

replay(1)

save(1)

update_target_model(1)

예제 #1

파일 보기

파일: train.py 프로젝트: sarachmax/My_Test_Model_1

def watch_result(episode, s_time, e_time, c_index, all_index, action, reward,
                 profit):
    print('-------------------- Check -------------------------')
    print('start time: ' + s_time)
    print('counter : ', c_index, '/', all_index, ' of episode : ', episode,
          '/', EPISODES)
    print('action : ', action)
    print('current profit : ', profit * MARGIN)
    print('reward (all profit): ', reward)
    print('end_time: ' + e_time)
    print('-------------------End Check -----------------------')


if __name__ == "__main__":

    agent = DQNAgent(state_size)
    #agent.load("agent_model.h5")
    num_index = all_index - state_size
    env = TrainEnvironment(X_train, num_index)
    batch_size = 32
    for e in range(EPISODES):
        state = env.reset()
        state = np.reshape(state, (1, state_size, 1))

        for t in range(end_index - start_index):
            start_time = str(datetime.datetime.now().time())
            action = agent.act(state)
            next_state, reward, done = env.step(action)
            next_state = np.reshape(next_state, (1, state_size, 1))
            agent.remember(state, action, reward, next_state, done)
            state = next_state

예제 #2

파일 보기

def watch_result(episode, s_time, e_time, c_index, all_index, action, reward,
                 profit):
    print('-------------------- Check -------------------------')
    print('start time: ' + s_time)
    print('counter : ', c_index, '/', all_index, ' of episode : ', episode,
          '/', EPISODES)
    print('action : ', action)
    print('current profit : ', profit * MARGIN)
    print('reward (all profit): ', reward)
    print('end_time: ' + e_time)
    print('-------------------End Check -----------------------')


if __name__ == "__main__":

    agent = DQNAgent(state_size)
    agent.load("agent_model.h5")
    num_index = all_index - state_size
    env = TrainEnvironment(X_train, num_index)
    batch_size = 3
    test_profit = []

    for e in range(EPISODES):
        state = env.reset()
        state = np.reshape(state, (1, state_size, 1))
        test_profit = []
        for t in range(end_index - start_index):
            start_time = str(datetime.datetime.now().time())
            action = agent.act(state, False)  # test
            next_state, reward, done = env.step(action)
            next_state = np.reshape(next_state, (1, state_size, 1))