Esempi in Python per WhiteningNormalizer.update

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: rl.util

Classe/tipologia: WhiteningNormalizer

Metodo/funzione: update

Esempi su hotexamples.com: 4

WhiteningNormalizer.update in Python: 4 esempi trovati. Questi sono i migliori esempi reali in Python per rl.util.WhiteningNormalizer.update, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

WhiteningNormalizer(4)

normalize(3)

update(3)

denormalize(1)

Esempio n. 1

Mostra file

class WhiteningNormalizerProcessor(Processor):
    """Normalizes the observations to have zero mean and standard deviation of one,
    i.e. it applies whitening to the inputs.

    This typically helps significantly with learning, especially if different dimensions are
    on different scales. However, it complicates training in the sense that you will have to store
    these weights alongside the policy if you intend to load it later. It is the responsibility of
    the user to do so.
    """
    def __init__(self):
        self.normalizer = None

    def process_state_batch(self, batch):
        if self.normalizer is None:
            self.normalizer = WhiteningNormalizer(shape=batch.shape[1:],
                                                  dtype=batch.dtype)
        self.normalizer.update(batch)
        return self.normalizer.normalize(batch)

    def process_action(self, action):
        upper_action, delta_x_norm, acc_norm = action
        delta_x = np.clip((delta_x_norm + 1) / 2 * 50 + 10, 10, 60)
        acc = np.clip(acc_norm * 3, -3, 3)

        return upper_action, delta_x, acc

    @staticmethod
    def process_reward_batch(batch):
        return batch / 100

Esempio n. 2

Mostra file

File: test_util.py Progetto: matthiasplappert/keras-rl

def test_whitening_normalizer():
    x = np.random.normal(loc=.2, scale=2., size=(1000, 5))
    normalizer = WhiteningNormalizer(shape=(5,))
    normalizer.update(x[:500])
    normalizer.update(x[500:])

    assert_allclose(normalizer.mean, np.mean(x, axis=0))
    assert_allclose(normalizer.std, np.std(x, axis=0))
    
    x_norm = normalizer.normalize(x)
    assert_allclose(np.mean(x_norm, axis=0), np.zeros(5, dtype=normalizer.dtype), atol=1e-5)
    assert_allclose(np.std(x_norm, axis=0), np.ones(5, dtype=normalizer.dtype), atol=1e-5)

    x_denorm = normalizer.denormalize(x_norm)
    assert_allclose(x_denorm, x)

Esempio n. 3

Mostra file

File: processors.py Progetto: Bosmansc/tetris_openai

class WhiteningNormalizerProcessor(Processor):
    """Normalizes the observations to have zero mean and standard deviation of one,
    i.e. it applies whitening to the inputs.

    This typically helps significantly with learning, especially if different dimensions are
    on different scales. However, it complicates training in the sense that you will have to store
    these weights alongside the policy if you intend to load it later. It is the responsibility of
    the user to do so.
    """
    def __init__(self):
        self.normalizer = None

    def process_state_batch(self, batch):
        if self.normalizer is None:
            self.normalizer = WhiteningNormalizer(shape=batch.shape[1:], dtype=batch.dtype)
        self.normalizer.update(batch)
        return self.normalizer.normalize(batch)

Esempio n. 4

Mostra file

File: test_util.py Progetto: joemeyer1/keras-rl-tetris

def test_whitening_normalizer():
    x = np.random.normal(loc=.2, scale=2., size=(1000, 5))
    normalizer = WhiteningNormalizer(shape=(5, ))
    normalizer.update(x[:500])
    normalizer.update(x[500:])

    assert_allclose(normalizer.mean, np.mean(x, axis=0))
    assert_allclose(normalizer.std, np.std(x, axis=0))

    x_norm = normalizer.normalize(x)
    assert_allclose(np.mean(x_norm, axis=0),
                    np.zeros(5, dtype=normalizer.dtype),
                    atol=1e-5)
    assert_allclose(np.std(x_norm, axis=0),
                    np.ones(5, dtype=normalizer.dtype),
                    atol=1e-5)

    x_denorm = normalizer.denormalize(x_norm)
    assert_allclose(x_denorm, x)