Python Action.sample 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: overcooked_ai_py.mdp.actions

클래스/타입: Action

메소드/함수: sample

hotexamples.com에서의 예제들: 4

Python Action.sample - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 overcooked_ai_py.mdp.actions.Action.sample에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

sample(4)

move_in_direction(2)

remove_indices_and_renormalize(1)

예제 #1

파일 보기

 def actions(self, states, agent_indices):
     action_probs_n = self.policy.multi_state_policy(states, agent_indices)
     actions_and_infos_n = []
     for action_probs in action_probs_n:
         action = Action.sample(action_probs)
         actions_and_infos_n.append((action, {"action_probs": action_probs}))
     return actions_and_infos_n

예제 #2

파일 보기

파일: agent.py 프로젝트: ying-wen/overcooked_ai

 def action(self, state):
     action_probs = np.zeros(Action.NUM_ACTIONS)
     legal_actions = list(Action.MOTION_ACTIONS)
     if self.interact:
         legal_actions.append(Action.INTERACT)
     legal_actions_indices = np.array([Action.ACTION_TO_INDEX[motion_a] for motion_a in legal_actions])
     action_probs[legal_actions_indices] = 1 / len(legal_actions_indices)
     return Action.sample(action_probs), {"action_probs": action_probs}

예제 #3

파일 보기

    def action(self, state):
        action_probs = np.zeros(Action.NUM_ACTIONS)
        legal_actions = list(Action.MOTION_ACTIONS)
        if self.all_actions:
            legal_actions = Action.ALL_ACTIONS
        legal_actions_indices = np.array([Action.ACTION_TO_INDEX[motion_a] for motion_a in legal_actions])
        action_probs[legal_actions_indices] = 1 / len(legal_actions_indices)

        if self.custom_wait_prob is not None:
            stay = Action.STAY
            if np.random.random() < self.custom_wait_prob:
                return stay, {"action_probs": Agent.a_probs_from_action(stay)}
            else:
                action_probs = Action.remove_indices_and_renormalize(action_probs, [Action.ACTION_TO_INDEX[stay]])

        return Action.sample(action_probs), {"action_probs": action_probs}

예제 #4

파일 보기

파일: agent.py 프로젝트: zhanyon/overcooked_ai

 def action(self, state):
     action_probs = np.zeros(Action.NUM_ACTIONS)
     for agent in self.agents:
         action_probs += agent.action(state)[1]["action_probs"]
     action_probs = action_probs / len(self.agents)
     return Action.sample(action_probs), {"action_probs": action_probs}