Python calc_q_value_logits Exemples

Langage de programmation: Python

Espace de nommage/Pack: slm_lab.lib.math_util

Méthode/Fonction: calc_q_value_logits

Exemples au hotexamples.com: 3

Python calc_q_value_logits - 3 exemples trouvés. Ce sont les exemples réels les mieux notés de slm_lab.lib.math_util.calc_q_value_logits extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Exemple #1

0

Afficher le fichier

Fichier : mlp.py Projet : wilson1yan/SLM-Lab

def forward(self, x): '''The feedforward step''' x = self.model_body(x) state_value = self.v(x) raw_advantages = self.adv(x) out = math_util.calc_q_value_logits(state_value, raw_advantages) return out

Exemple #2

0

Afficher le fichier

def forward(self, x): '''The feedforward step''' x = self.conv_model(x) x = x.view(x.size(0), -1) # to (batch_size, -1) if hasattr(self, 'fc_model'): x = self.fc_model(x) state_value = self.v(x) raw_advantages = self.adv(x) out = math_util.calc_q_value_logits(state_value, raw_advantages) return out

Exemple #3

0

Afficher le fichier

Fichier : test_math_util.py Projet : c-w-m/slm-lab

def test_calc_q_value_logits(): state_value = torch.tensor([[1.], [2.], [3.]]) advantages = torch.tensor([[0., 1.], [1., 1.], [1., 0.]]) result = torch.tensor([[0.5, 1.5], [2.0, 2.0], [3.5, 2.5]]) out = math_util.calc_q_value_logits(state_value, advantages) assert torch.allclose(out, result)