Python softmax Examples

Programming Language: Python

Namespace/Package Name: maci.utils

Method/Function: softmax

Examples at hotexamples.com: 2

Python softmax - 2 examples found. These are the top rated real world Python examples of maci.utils.softmax extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: Q.py Project: wwxFromTju/rommeo

 def update_policy(self, s, a, game):
     if self.a_policy == 'softmax':
         self.pi[s] = utils.softmax(np.sum(np.multiply(self.Q[s], self.opponent_best_pi[s]), 1))
     else:
         Q = np.sum(np.multiply(self.Q[s], self.opponent_best_pi[s]), 1)
         self.pi[s] = (Q == np.max(Q)).astype(np.double)
     self.pi_history.append(deepcopy(self.pi))
     self.opponent_best_pi_history.append(deepcopy(self.opponent_best_pi))
     print('opponent pi of {}: {}'.format(self.id_, self.opponent_best_pi))

Example #2

Show file

    def update_policy(self, s, a, game):
        # print('Qs {}'.format(self.Q[s]))
        # print('OPI {}'.format(self.opponent_best_pi[s]))
        # print('pis: ' + str(np.dot(self.Q[s], self.opponent_best_pi[s])))
        self.pi[s] = utils.softmax(np.dot(self.Q[s], self.opponent_pi[s]))

        # print('pis: ' + str(np.sum(np.dot(self.Q[s], self.opponent_best_pi[s]))))
        self.pi_history.append(deepcopy(self.pi))
        self.opponent_pi_history.append(deepcopy(self.opponent_pi))
        print('opponent pi of {}: {}'.format(self.id_, self.opponent_pi[s]))