Python DeepQAgentParams.env 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: graphrl.agents.deep_q_agent

클래스/타입: DeepQAgentParams

메소드/함수: env

hotexamples.com에서의 예제들: 2

Python DeepQAgentParams.env - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 graphrl.agents.deep_q_agent.DeepQAgentParams.env에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

DeepQAgentParams(4)

make_agent(4)

online_q_net(4)

sacred_run(4)

target_q_net(4)

obs_filter(3)

env(2)

mode(2)

test_envs(2)

train_env(2)

예제 #1

파일 보기

def main(game, _seed, _run):
    torch.manual_seed(_seed)

    game = lower_under_to_upper(game) + 'NoFrameskip-v4'
    env = gym.make(game)
    env = wrap_deepmind(env)

    input_space = env.observation_space
    num_actions = env.action_space.n

    agent_params = DeepQAgentParams()
    add_params(params=agent_params, prefix='agent')
    add_params(params=agent_params.optimizer_params, prefix='opt')
    add_epsilon_params(params=agent_params)
    agent_params.obs_filter = AtariObservationFilter()

    input_space = agent_params.obs_filter.output_space(input_space)

    agent_params.sacred_run = _run
    agent_params.env = env
    agent_params.mode = 'train'

    online_q_net = build_net(input_shape=input_space.shape,
                             num_actions=num_actions)
    target_q_net = build_net(input_shape=input_space.shape,
                             num_actions=num_actions)
    agent_params.online_q_net = online_q_net
    agent_params.target_q_net = target_q_net

    agent = agent_params.make_agent()
    agent.run()

예제 #2

파일 보기

파일: train_dqn.py 프로젝트: varunkumar3618/sdfgsfgs

def main(_seed, _run):
    torch.manual_seed(_seed)

    env = build_env()
    input_shape = env.observation_space.shape
    num_actions = env.action_space.n

    agent_params = DeepQAgentParams()
    add_params(params=agent_params, prefix='agent')
    add_params(params=agent_params.optimizer_params, prefix='opt')
    add_epsilon_params(params=agent_params)

    agent_params.sacred_run = _run
    agent_params.env = env
    agent_params.mode = 'train'

    online_q_net = build_net(input_shape=input_shape, num_actions=num_actions)
    target_q_net = build_net(input_shape=input_shape, num_actions=num_actions)
    agent_params.online_q_net = online_q_net
    agent_params.target_q_net = target_q_net

    agent = agent_params.make_agent()
    agent.run()