Python build_ddpg_models Examples

Programming Language: Python

Namespace/Package Name: ray.rllib.agents.ddpg.ddpg_tf_policy

Method/Function: build_ddpg_models

Examples at hotexamples.com: 3

Python build_ddpg_models - 3 examples found. These are the top rated real world Python examples of ray.rllib.agents.ddpg.ddpg_tf_policy.build_ddpg_models extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: ddpg_torch_policy.py Project: zjureel/ray

def build_ddpg_models_and_action_dist(policy, obs_space, action_space, config):
    model = build_ddpg_models(policy, obs_space, action_space, config)
    # TODO(sven): Unify this once we generically support creating more than
    #  one Model per policy. Note: Device placement is done automatically
    #  already for `policy.model` (but not for the target model).
    device = (torch.device("cuda")
              if torch.cuda.is_available() else torch.device("cpu"))
    policy.target_model = policy.target_model.to(device)
    return model, TorchDeterministic

Example #2

Show file

File: ddpg_torch_policy.py Project: stefanbschneider/ray

def build_ddpg_models_and_action_dist(
        policy: Policy, obs_space: gym.spaces.Space,
        action_space: gym.spaces.Space,
        config: TrainerConfigDict) -> Tuple[ModelV2, ActionDistribution]:
    model = build_ddpg_models(policy, obs_space, action_space, config)

    if isinstance(action_space, Simplex):
        return model, TorchDirichlet
    else:
        return model, TorchDeterministic

Example #3

Show file

File: ddpg_torch_policy.py Project: yncxcw/ray

def build_ddpg_models_and_action_dist(
        policy: Policy, obs_space: gym.spaces.Space,
        action_space: gym.spaces.Space,
        config: TrainerConfigDict) -> Tuple[ModelV2, ActionDistribution]:
    model = build_ddpg_models(policy, obs_space, action_space, config)
    # TODO(sven): Unify this once we generically support creating more than
    #  one Model per policy. Note: Device placement is done automatically
    #  already for `policy.model` (but not for the target model).
    device = (torch.device("cuda")
              if torch.cuda.is_available() else torch.device("cpu"))
    policy.target_model = policy.target_model.to(device)

    if isinstance(action_space, Simplex):
        return model, TorchDirichlet
    else:
        return model, TorchDeterministic