Python q_categorical示例

编程语言: Python

命名空间/包名称: flare.framework.common_functions

方法/功能: q_categorical

hotexamples.com的示例: 2

Python q_categorical - 已找到2个示例。这些是从开源项目中提取的最受好评的flare.framework.common_functions.q_categorical现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

 def policy(self, inputs, states):
     """
     We first compute the successor features and then use the goal vector to
     compute the Q values.
     """
     srs, states = self.sr(inputs, states)
     goal = self.goal(inputs).unsqueeze(1).expand(-1, self.num_actions, -1)
     q_value = torch.sum(torch.mul(goal, srs),
                         dim=-1).view(-1, self.num_actions)
     return dict(action=comf.q_categorical(q_value)), states

示例#2

显示文件

文件： simple_models.py 项目： skylian/flare

 def policy(self, inputs, states):
     values, states = self.value(inputs, states)
     q_value = values["q_value"]
     return dict(action=comf.q_categorical(q_value)), states