Python MCT_Node 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils

클래스/타입: MCT_Node

hotexamples.com에서의 예제들: 2

Python MCT_Node - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.MCT_Node에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MCT_Node(2)

자주 사용되는 메소드들

MCT_Node (2)

예제 #1

파일 보기

 def expand(n):
     """expand the leaf node by adding all its children states"""
     if not n.children and not game.terminal_test(n.state):
         n.children = {
             MCT_Node(state=game.result(n.state, action), parent=n): action
             for action in game.actions(n.state)
         }
     return select(n)

예제 #2

파일 보기

def monte_carlo_tree_search(state, game, N=1000):
    def select(n):
        """select a leaf node in the tree"""
        if n.children:
            return select(max(n.children.keys(), key=ucb))
        else:
            return n

    def expand(n):
        """expand the leaf node by adding all its children states"""
        if not n.children and not game.terminal_test(n.state):
            n.children = {
                MCT_Node(state=game.result(n.state, action), parent=n): action
                for action in game.actions(n.state)
            }
        return select(n)

    def simulate(game, state):
        """simulate the utility of current state by random picking a step"""
        player = game.to_move(state)
        while not game.terminal_test(state):
            action = random.choice(list(game.actions(state)))
            state = game.result(state, action)
        v = game.utility(state, player)
        return -v

    def backprop(n, utility):
        """passing the utility back to all parent nodes"""
        if utility > 0:
            n.U += utility
        # if utility == 0:
        #     n.U += 0.5
        n.N += 1
        if n.parent:
            backprop(n.parent, -utility)

    root = MCT_Node(state=state)

    for _ in range(N):
        leaf = select(root)
        child = expand(leaf)
        result = simulate(game, child.state)
        backprop(child, result)

    max_state = max(root.children, key=lambda p: p.N)

    return root.children.get(max_state)