Python StepSequence.get_data_values示例

编程语言: Python

命名空间/包名称: pyrado.sampling.step_sequence

类/类型: StepSequence

方法/功能: get_data_values

hotexamples.com的示例: 5

Python StepSequence.get_data_values - 已找到5个示例。这些是从开源项目中提取的最受好评的pyrado.sampling.step_sequence.StepSequence.get_data_values现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

StepSequence(23)

concat(19)

torch(7)

iterate_rollouts(6)

get_data_values(5)

pad(5)

add_data(4)

numpy(3)

from_pandas(2)

process_data(2)

get_rollout(1)

split_shuffled_batches(1)

示例#1

显示文件

    def loss_fcn(self, rollout_real: StepSequence,
                 rollout_sim: StepSequence) -> float:
        """
        Compute the discrepancy between two time sequences of observations given metric.
        Be sure to align and truncate the rollouts beforehand.

        :param rollout_real: (concatenated) real-world rollout containing the observations
        :param rollout_sim: (concatenated) simulated rollout containing the observations
        :return: discrepancy cost summed over the observation dimensions
        """
        if len(rollout_real) != len(rollout_sim):
            raise pyrado.ShapeErr(given=rollout_real,
                                  expected_match=rollout_sim)

        # Extract the observations
        real_obs = rollout_real.get_data_values("observations",
                                                truncate_last=True)
        sim_obs = rollout_sim.get_data_values("observations",
                                              truncate_last=True)

        # Filter the observations
        real_obs = gaussian_filter1d(real_obs, self.std_obs_filt, axis=0)
        sim_obs = gaussian_filter1d(sim_obs, self.std_obs_filt, axis=0)

        # Normalize the signals
        real_obs_norm = self.obs_normalizer.project_to(real_obs)
        sim_obs_norm = self.obs_normalizer.project_to(sim_obs)

        # Compute loss based on the error
        loss_per_obs_dim = self.metric(real_obs_norm - sim_obs_norm)
        assert len(loss_per_obs_dim) == real_obs.shape[1]
        assert all(loss_per_obs_dim >= 0)
        return sum(loss_per_obs_dim)

示例#2

显示文件

def convert_step_sequence(traj: StepSequence):
    """
    Converts a StepSequence to a Tensor which can be fed through a Network

    :param traj: A step sequence containing a trajectory
    :return: A Tensor containing the trajectory
    """
    assert isinstance(traj, StepSequence)
    traj.torch()
    state = traj.get_data_values('observations')[:-1].double()
    next_state = traj.get_data_values('observations')[1::].double()
    action = traj.get_data_values('actions').narrow(
        0, 0, next_state.shape[0]).double()
    traj = to.cat((state, next_state, action), 1).cpu().double()
    return traj

示例#3

显示文件

def preprocess_rollout(rollout: StepSequence) -> StepSequence:
    """
    Extracts observations and actions from a `StepSequence` and packs them into a PyTorch tensor which can be fed
    through a network.

    :param rollout: a `StepSequence` instance containing a trajectory
    :return: a PyTorch tensor` containing the trajectory
    """
    if not isinstance(rollout, StepSequence):
        raise pyrado.TypeErr(given=rollout, expected_type=StepSequence)

    # Convert data type
    rollout.torch(to.get_default_dtype())

    # Extract the data
    state = rollout.get_data_values("observations")[:-1]
    next_state = rollout.get_data_values("observations")[1::]
    action = rollout.get_data_values("actions").narrow(0, 0,
                                                       next_state.shape[0])

    rollout = to.cat((state, next_state, action), 1)
    return rollout

示例#4

显示文件

    def evaluate(self,
                 rollout: StepSequence,
                 hidden_states_name: str = 'hidden_states') -> to.Tensor:
        """
        Re-evaluate the given rollout and return a derivable action tensor.
        The default implementation simply calls `forward()`.

        :param rollout: recorded, complete rollout
        :param hidden_states_name: name of hidden states rollout entry, used for recurrent networks.
                                   Defaults to 'hidden_states'. Change for value functions.
        :return: actions with gradient data
        """
        self.eval()
        return self(rollout.get_data_values(
            'observations', truncate_last=True))  # all observations at once

示例#5

显示文件

文件： base.py 项目： arlene-kuehn/SimuRLacra

    def evaluate(self,
                 rollout: StepSequence,
                 hidden_states_name: str = 'hidden_states') -> to.Tensor:
        """
        Re-evaluate the given rollout and return a derivable action tensor.
        The default implementation simply calls `forward()`.

        :param rollout: complete rollout
        :param hidden_states_name: name of hidden states rollout entry, used for recurrent networks.
                                   Defaults to 'hidden_states'. Change for value functions.
        :return: actions with gradient data
        """
        # Set policy, i.e. PyTorch nn.Module, to evaluation mode
        self.eval()

        res = self(rollout.get_data_values(
            'observations', truncate_last=True))  # all observations at once

        # Set policy, i.e. PyTorch nn.Module, back to training mode
        self.train()

        return res