Python softmax示例

编程语言: Python

命名空间/包名称: agents.rl.utils.functions

方法/功能: softmax

hotexamples.com的示例: 3

Python softmax - 已找到3个示例。这些是从开源项目中提取的最受好评的agents.rl.utils.functions.softmax现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： A2CQPGAgent.py 项目： Malen11/ImpInf-Games-Agents

    def eval_step(self, state):

        batch = [state['obs']]
        ts = tf.convert_to_tensor(batch)

        logits, _ = self.bot.predict(ts)
        probs = softmax(logits, state['legal_actions'])[0]
        best_action = np.argmax(probs)
        return best_action, probs

示例#2

显示文件

    def eval_step(self, state):
        self.bot.lstm.add_data(state['obs'])
        batch = [self.bot.lstm.get_data()]
        ts = tf.convert_to_tensor(batch)

        logits = self.bot.predict_policy(ts)
        probs = softmax(logits, state['legal_actions'])[0]
        best_action = np.argmax(probs)
        return best_action, probs

示例#3

显示文件

    def get_action(self, state, legal_actions):

        batch = [state]
        ts = tf.convert_to_tensor(batch)

        logits = self.predict_policy(ts)
        probs = softmax(logits, legal_actions)[0]
        selected_action = np.random.choice(self.num_actions, p=probs)

        return selected_action