Python Trainer.trainの例

プログラミング言語: Python

名前空間/パッケージ名: agents.Trainer

クラス/型: Trainer

メソッド/関数: train

hotexamples.comのコード掲載数: 3

Python Trainer.train - 3件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのagents.Trainer.Trainer.trainの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Trainer(9)

run_games_for_agents(6)

train(3)

eval_model(1)

get_mean_and_standard_deviation_difference_results(1)

render_games_for_pretrained_agent(1)

visualise_preexisting_results(1)

コード例 #1

ファイルを表示

ファイル: Test_Agents.py プロジェクト: Rafapia/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

def test_agent_solve_bit_flipping_game():
    AGENTS = [PPO, DDQN, DQN_With_Fixed_Q_Targets, DDQN_With_Prioritised_Experience_Replay, DQN, DQN_HER]
    trainer = Trainer(config, AGENTS)
    results = trainer.train()
    for agent in AGENTS:
        agent_results = results[agent.agent_name]
        agent_results = np.max(agent_results[0][1][50:])
        assert agent_results >= 0.0, "Failed for {} -- score {}".format(agent.agent_name, agent_results)

コード例 #2

ファイルを表示

ファイル: Test_Agents.py プロジェクト: Rafapia/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

def test_agents_can_play_games_of_different_dimensions():
    config.num_episodes_to_run = 10
    config.hyperparameters["DQN_Agents"]["batch_size"] = 3
    AGENTS = [A2C, A3C, PPO, DDQN, DQN_With_Fixed_Q_Targets, DDQN_With_Prioritised_Experience_Replay, DQN]
    trainer = Trainer(config, AGENTS)
    config.environment = gym.make("CartPole-v0")
    results = trainer.train()
    for agent in AGENTS:
        assert agent.agent_name in results.keys()

    AGENTS = [SAC, TD3, PPO, DDPG]
    config.environment = gym.make("MountainCarContinuous-v0")
    trainer = Trainer(config, AGENTS)
    results = trainer.train()
    for agent in AGENTS:
        assert agent.agent_name in results.keys()

    AGENTS = [DDQN, SNN_HRL]
    config.environment = Four_Rooms_Environment(15, 15, stochastic_actions_probability=0.25,
                                                random_start_user_place=True, random_goal_place=False)
    trainer = Trainer(config, AGENTS)
    results = trainer.train()
    for agent in AGENTS:
        assert agent.agent_name in results.keys()

コード例 #3

ファイルを表示

ファイル: ISC_Discrete_Recurrent_Trainer.py プロジェクト: Rafapia/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

        "batch_size": 128,
        "buffer_size": 100000,
        "epsilon": 1.0,
        "epsilon_decay_rate_denominator": 150,
        "discount_rate": 0.999,
        "alpha_prioritised_replay": 0.6,
        "beta_prioritised_replay": 0.1,
        "incremental_td_error": 1e-8,
        "update_every_n_steps": 15,
        "tau": 1e-2,
        "linear_hidden_units": [256, 256],
        "final_layer_activation": "softmax",
        # "y_range": (-1, 14),
        "batch_norm": False,
        "gradient_clipping_norm": 5,
        "HER_sample_proportion": 0.8,
        "learning_iterations": 1,
        "clip_rewards": False
    }
}

config.model = FCNN()

if __name__== '__main__':
    AGENTS = [DQN, DRQN, ]#DDQN, Dueling_DDQN, DDQN_With_Prioritised_Experience_Replay]

    trainer = Trainer(config, AGENTS)
    trainer.train()