Ejemplos de stable_normalizer en Python

Lenguaje de programación: Python

Namespace/Package Name: helpers

Método / Función: stable_normalizer

Ejemplos en hotexamples.com: 3

Python stable_normalizer - 3 ejemplos encontrados. Estos son los ejemplos en Python del mundo real mejor valorados de helpers.stable_normalizer extraídos de proyectos de código abierto. Puedes valorar ejemplos para ayudarnos a mejorar la calidad de los ejemplos.

Ejemplo n.º 1

Mostrar archivo

Archivo: mcts.py Proyecto: amarildolikmeta/alphazero_singleplayer

 def return_results(self, temp):
     ''' Process the output at the root node '''
     counts = np.array([child_action.n for child_action in self.root.child_actions])
     Q = np.array([child_action.Q for child_action in self.root.child_actions])
     pi_target = stable_normalizer(counts, temp)
     V_target = np.sum((counts / np.sum(counts)) * Q)[None]
     return self.root.index.flatten(), pi_target, V_target

Ejemplo n.º 2

Mostrar archivo

Archivo: alphazerotest.py Proyecto: ReHoss/test_alphazero

 def return_results(self, temp):
     """ Process the output at the root node """
     counts = np.array(
         [child_action.n for child_action in self.root.child_actions])
     q = np.array(
         [child_action.q for child_action in self.root.child_actions])
     pi_target = stable_normalizer(counts, temp)
     v_target = np.sum((counts / np.sum(counts)) * q)[None]
     return self.root.index, pi_target, v_target

Ejemplo n.º 3

Mostrar archivo

Archivo: ol_uct.py Proyecto: amarildolikmeta/alphazero_singleplayer

 def return_results(self, temp, on_visits=False):
     """ Process the output at the root node """
     counts = np.array(
         [child_action.n for child_action in self.root.child_actions])
     Q = np.array(
         [child_action.Q for child_action in self.root.child_actions])
     if on_visits:
         pi_target = stable_normalizer(counts, temp)
     else:
         pi_target = max_Q(Q)
     V_target = np.sum((counts / np.sum(counts)) * Q)[None]
     return self.root_signature, pi_target, V_target