Python DistributedD4PG 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: acme.agents.tf.d4pg

메소드/함수: DistributedD4PG

hotexamples.com에서의 예제들: 4

Python DistributedD4PG - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 acme.agents.tf.d4pg.DistributedD4PG에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: lp_d4pg.py 프로젝트: vishalbelsare/acme

def main(_):
    environment_factory = lp_utils.partial_kwargs(helpers.make_environment,
                                                  task=FLAGS.task)

    program = d4pg.DistributedD4PG(environment_factory=environment_factory,
                                   network_factory=lp_utils.partial_kwargs(
                                       helpers.make_networks),
                                   num_actors=2).build()

    lp.launch(program, xm_resources=lp_utils.make_xm_docker_resources(program))

예제 #2

파일 보기

def main(_):
    environment_factory = lp_utils.partial_kwargs(helpers.make_environment,
                                                  task=FLAGS.task)

    program = d4pg.DistributedD4PG(environment_factory=environment_factory,
                                   network_factory=lp_utils.partial_kwargs(
                                       helpers.make_networks),
                                   num_actors=2).build()

    lp.launch(program, lp.LaunchType.LOCAL_MULTI_PROCESSING)

예제 #3

파일 보기

def main(_):
    # Configure the environment factory with requested task.
    make_environment = functools.partial(helpers.make_environment,
                                         domain_name=_DOMAIN.value,
                                         task_name=_TASK.value)

    # Construct the program.
    program_builder = d4pg.DistributedD4PG(
        make_environment,
        make_networks,
        max_actor_steps=_MAX_ACTOR_STEPS.value,
        num_actors=4)

    # Launch experiment.
    lp.launch(programs=program_builder.build())

예제 #4

파일 보기

    def test_control_suite(self):
        """Tests that the agent can run on the control suite without crashing."""

        agent = d4pg.DistributedD4PG(
            environment_factory=lambda x: fakes.ContinuousEnvironment(bounded=
                                                                      True),
            network_factory=make_networks,
            num_actors=2,
            batch_size=32,
            min_replay_size=32,
            max_replay_size=1000,
        )
        program = agent.build()

        (learner_node, ) = program.groups['learner']
        learner_node.disable_run()

        lp.launch(program, launch_type='test_mt')

        learner: acme.Learner = learner_node.create_handle().dereference()

        for _ in range(5):
            learner.step()