Python scale_by_rss примеры использования

Язык программирования: Python

Пространство имен/Пакет: optax._src.transform

Метод/Функция: scale_by_rss

Примеров на hotexamples.com: 2

Python scale_by_rss - 2 примера найдено. Это лучшие примеры Python кода для optax._src.transform.scale_by_rss, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Пример #1

Показать файл

Файл: alias.py Проект: ksachdeva/optax

def adagrad(learning_rate: ScalarOrSchedule,
            initial_accumulator_value: float = 0.1,
            eps: float = 1e-7) -> base.GradientTransformation:
    """The Adagrad optimizer.

  Adagrad is an algorithm for gradient based optimisation that anneals the
  learning rate for each parameter during the course of training.

  WARNING: Adagrad's main limit is the monotonic accumulation of squared
  gradients in the denominator: since all terms are >0, the sum keeps growing
  during training and the learning rate eventually becomes vanishingly small.

  References:
    [Duchi et al, 2011](https://jmlr.org/papers/v12/duchi11a.html)

  Args:
    learning_rate: this is a fixed global scaling factor.
    initial_accumulator_value: initialisation for the accumulator.
    eps: a small constant applied to denominator inside of the square root
      (as in RMSProp) to avoid dividing by zero when rescaling.

  Returns:
    the corresponding `GradientTransformation`.
  """
    return combine.chain(
        transform.scale_by_rss(
            initial_accumulator_value=initial_accumulator_value, eps=eps),
        _scale_by_learning_rate(learning_rate),
    )

Пример #2

Показать файл

def adagrad(learning_rate: float,
            initial_accumulator_value: float = 0.1,
            eps: float = 1e-7) -> GradientTransformation:
    return combine.chain(
        transform.scale_by_rss(
            initial_accumulator_value=initial_accumulator_value, eps=eps),
        transform.scale(-learning_rate),
    )