Python TFPolicy.get_weights示例

编程语言: Python

命名空间/包名称: mlagents.trainers.tf_policy

类/类型: TFPolicy

方法/功能: get_weights

hotexamples.com的示例: 3

Python TFPolicy.get_weights - 已找到3个示例。这些是从开源项目中提取的最受好评的mlagents.trainers.tf_policy.TFPolicy.get_weights现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

TFPolicy(6)

get_action(6)

evaluate(4)

create_tf_graph(3)

get_current_step(3)

get_weights(3)

init_load_weights(2)

save_memories(2)

示例#1

显示文件

文件： trainer.py 项目： donachys/ml-agents

 def _save_snapshot(self, policy: TFPolicy) -> None:
     weights = policy.get_weights()
     try:
         self.policy_snapshots[self.snapshot_counter] = weights
     except IndexError:
         self.policy_snapshots.append(weights)
     self.policy_elos[self.snapshot_counter] = self.current_elo
     self.snapshot_counter = (self.snapshot_counter + 1) % self.window

示例#2

显示文件

    def add_policy(self, name_behavior_id: str, policy: TFPolicy) -> None:
        # for saving/swapping snapshots
        policy.init_load_weights()
        self.policies[name_behavior_id] = policy

        # First policy encountered
        if not self.learning_behavior_name:
            weights = policy.get_weights()
            self.current_policy_snapshot = weights
            self._save_snapshot(policy)
            self.trainer.add_policy(name_behavior_id, policy)
            self.learning_behavior_name = name_behavior_id

示例#3

显示文件

文件： trainer.py 项目： donachys/ml-agents

    def add_policy(self, name_behavior_id: str, policy: TFPolicy) -> None:
        """
        Adds policy to trainer. For the first policy added, add a trainer
        to the policy and set the learning behavior name to name_behavior_id.
        :param name_behavior_id: Behavior ID that the policy should belong to.
        :param policy: Policy to associate with name_behavior_id.
        """
        self.policies[name_behavior_id] = policy
        policy.create_tf_graph()

        # First policy encountered
        if not self.learning_behavior_name:
            weights = policy.get_weights()
            self.current_policy_snapshot = weights
            self.trainer.add_policy(name_behavior_id, policy)
            self._save_snapshot(policy)  # Need to save after trainer initializes policy
            self.learning_behavior_name = name_behavior_id
        else:
            # for saving/swapping snapshots
            policy.init_load_weights()