Python GameSimulator.get_total_reward 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: GameSimulator

클래스/타입: GameSimulator

메소드/함수: get_total_reward

hotexamples.com에서의 예제들: 4

Python GameSimulator.get_total_reward - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 GameSimulator.GameSimulator.get_total_reward에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

GameSimulator(6)

get_total_reward(4)

get_state(2)

generate_payouts(1)

get_action_size(1)

get_last_action(1)

get_weighted_powers(1)

initialize(1)

is_episode_finished(1)

play(1)

예제 #1

파일 보기

파일: Main.py 프로젝트: fA1sEr/ADRQN2

                action = agent.act(state)
                img_state, reward, done = game.make_action(action)
                if not done:
                    state_new = img_state
                else:
                    state_new = None
                agent.add_transition(state, action, reward, state_new, done)
                state = state_new

                if learning_step % UPDATE_FREQUENCY == 0:
                    agent.learn_from_memory()
                if learning_step % COPY_FREQUENCY == 0:
                    updateTarget(targetOps, SESSION)

                if done:
                    print("Epoch %d Train Game %d get %.1f" % (epoch, games_cnt, game.get_total_reward()))
                    break
            if SAVE_MODEL and games_cnt % 10 == 0:
                saver.save(SESSION, model_savefile)
                print("Saving the network weigths to:", model_savefile)

        print("\nTesting...")

        test_scores = []
        for test_step in range(EPISODES_TO_TEST):
            game.reset()
            agent.reset_cell_state()
            while not game.is_terminared():
                state = game.get_state()
                action = agent.act(state, train=False)
                game.make_action(action)

예제 #2

파일 보기

파일: Main.py 프로젝트: fA1sEr/ADRQN2-pong-test

trainables = tf.trainable_variables()

targetOps = updateTargetGraph(trainables, TAU)

print("Loading model from: ", model_savefile)
saver.restore(SESSION, model_savefile)

##########################################
print("\nTesting...")

test_scores = []

for test_step in range(EPISODES_TO_TEST):
    game.reset()
    agent.reset_cell_state()
    while not game.is_terminared():
        state = game.get_state()
        action = agent.act(state, train=False)
        game.make_action(action)
    now_score = game.get_total_reward()
    saveScore(now_score)
    test_scores.append(now_score)

test_scores = np.array(test_scores)
my_file = open(reward_savefile, 'a')  # Name and path of the reward text file
my_file.write("%.1f (±%.1f)  min:%.1f  max:%.1f\n" %
              (test_scores.mean(), test_scores.std(), test_scores.min(),
               test_scores.max()))
my_file.close()

예제 #3

파일 보기

파일: Main.py 프로젝트: fA1sEr/ADRQN2-pong0.5

                img_state, reward, done = game.make_action(action)
                if not done:
                    state_new = img_state
                else:
                    state_new = None
                agent.add_transition(state, action, reward, state_new, done)
                state = state_new

                if learning_step % UPDATE_FREQUENCY == 0:
                    agent.learn_from_memory()
                if learning_step % COPY_FREQUENCY == 0:
                    updateTarget(targetOps, SESSION)

                if done:
                    print("Epoch %d Train Game %d get %.1f" %
                          (epoch, games_cnt, game.get_total_reward()))
                    break
            if SAVE_MODEL and games_cnt % 10 == 0:
                saver.save(SESSION, model_savefile)
                print("Saving the network weigths to:", model_savefile)

        print("\nTesting...")

        test_scores = []
        for test_step in range(EPISODES_TO_TEST):
            game.reset()
            agent.reset_cell_state()
            while not game.is_terminared():
                state = game.get_state()
                action = agent.act(state, train=False)
                game.make_action(action)

예제 #4

파일 보기

            s, reward, d = game.make_action(action)
            done = game.is_terminared()
            if not done:
                state_new = preprocess(game.get_state())
            else:
                state_new = None

            agent.add_transition(state, action, reward, state_new, done)
            state = state_new

            if learning_step % UPDATE_FREQUENCY == 0:
                agent.learn_from_memory()
                updateTarget(targetOps, SESSION)

            if done:
                train_scores.append(game.get_total_reward())
                train_episodes_finished += 1
                game.reset()
                agent.reset_cell_state()
                state = preprocess(game.get_state())

        print("%d training episodes played." % train_episodes_finished)
        train_scores = np.array(train_scores)

        print(
            "Results: mean: %.1f±%.1f," %
            (train_scores.mean(), train_scores.std()),
            "min: %.1f," % train_scores.min(),
            "max: %.1f," % train_scores.max())

        print("\nTesting...")