Python TorchModelV2.parameters 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: ray.rllib.models.torch.torch_modelv2

클래스/타입: TorchModelV2

메소드/함수: parameters

hotexamples.com에서의 예제들: 2

Python TorchModelV2.parameters - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 ray.rllib.models.torch.torch_modelv2.TorchModelV2.parameters에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

__init__(17)

parameters(2)

decoder(1)

encoder(1)

imagine_ahead(1)

reward(1)

value(1)

value_function(1)

예제 #1

파일 보기

 def __init__(self, inputs: List[TensorType], model: TorchModelV2):
     # If inputs are not a torch Tensor, make them one and make sure they
     # are on the correct device.
     if not isinstance(inputs, torch.Tensor):
         inputs = torch.from_numpy(inputs)
         if isinstance(model, TorchModelV2):
             inputs = inputs.to(next(model.parameters()).device)
     super().__init__(inputs, model)
     # Store the last sample here.
     self.last_sample = None

예제 #2

파일 보기

    def __init__(self,
                 inputs: List[TensorType],
                 model: TorchModelV2,
                 low: float = -1.0,
                 high: float = 1.0):
        """Parameterizes the distribution via `inputs`.
        Args:
            low (float): The lowest possible sampling value
                (excluding this value).
            high (float): The highest possible sampling value
                (excluding this value).
        """
        super().__init__(inputs, model)

        assert low < high
        # Make sure high and low are torch tensors.
        self.low = torch.from_numpy(np.array(low))
        self.high = torch.from_numpy(np.array(high))
        # Place on correct device.
        if isinstance(model, TorchModelV2):
            device = next(model.parameters()).device
            self.low = self.low.to(device)
            self.high = self.high.to(device)

        mean, log_std = torch.chunk(self.inputs, 2, dim=-1)
        self._num_vars = mean.shape[1]
        assert log_std.shape[1] == self._num_vars
        # Clip `std` values (coming from NN) to reasonable values.
        self.log_std = torch.clamp(log_std, MIN_LOG_NN_OUTPUT,
                                   MAX_LOG_NN_OUTPUT)
        # Clip loc too, for numerical stability reasons.
        mean = torch.clamp(mean, -3, 3)
        std = torch.exp(self.log_std)
        self.distr = torch.distributions.normal.Normal(mean, std)
        assert len(self.distr.loc.shape) == 2
        assert len(self.distr.scale.shape) == 2