Python tf_masked_softmax Beispiele

Programmiersprache: Python

Namespace / Paketname: open_spiel.python.algorithms.masked_softmax

Methode / Funktion: tf_masked_softmax

Beispiele auf hotexamples.com: 3

Python tf_masked_softmax - 3 Beispiele gefunden. Dies sind die am besten bewerteten Python Beispiele für die open_spiel.python.algorithms.masked_softmax.tf_masked_softmax, die aus Open Source-Projekten extrahiert wurden. Sie können Beispiele bewerten, um die Qualität der Beispiele zu verbessern.

Beispiel #1

Datei anzeigen

Datei: masked_softmax_test.py Projekt: julianhartmann1/HCII

  def test_masked_softmax_on_all_invalid_moves(self):
    # If all actions are illegal, the behavior is undefined (it can be nan
    # or can be 0. We add this test to document this behavior and know if we
    # change it.
    np_logits = np.asarray([[
        [1.0, 1.0, 1.0],
        [0.0, 0.0, 0.0],
        [0.0, 0.0, 0.0],
    ]])
    logits = tf.Variable(np_logits, tf.float32)
    np_mask = np.asarray([[
        [1.0, 1.0, 1.0],
        [1.0, 0.0, 1.0],
        [0.0, 0.0, 0.0],
    ]])
    mask = tf.Variable(np_mask, tf.float32)

    expected = np.asarray([[
        [1 / 3, 1 / 3, 1 / 3],
        [1 / 2, 0.0, 1 / 2],
        [np.nan, np.nan, np.nan],
    ]])

    policy = masked_softmax.tf_masked_softmax(logits, mask)

    with tf.Session() as sess:
      sess.run(tf.global_variables_initializer())
      np_policy = sess.run(policy)
    np.testing.assert_array_almost_equal(expected, np_policy)

    # Numpy behaves similarly.
    np.testing.assert_array_almost_equal(
        expected, masked_softmax.np_masked_softmax(np_logits, np_mask))

Beispiel #2

Datei anzeigen

Datei: masked_softmax_test.py Projekt: julianhartmann1/HCII

  def test_tf_masked_softmax(self, np_logits, np_legal_actions, expected):
    logits = tf.Variable(np_logits, tf.float32)
    mask = tf.Variable(np_legal_actions, tf.float32)

    policy = masked_softmax.tf_masked_softmax(logits, mask)

    with tf.Session() as sess:
      sess.run(tf.global_variables_initializer())
      np_policy = sess.run(policy)

    np.testing.assert_array_almost_equal(expected, np_policy)

Beispiel #3

Datei anzeigen

 def masked_softmax(self, logits):
     """Safe masked softmax."""
     return masked_softmax.tf_masked_softmax(
         logits, self.tabular_policy.legal_actions_mask)