Python move 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: pong_environment_play_muscle

메소드/함수: move

hotexamples.com에서의 예제들: 2

Python move - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 pong_environment_play_muscle.move에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: td_pong_play.py 프로젝트: CiNCFZJ/CiNC

def cum_softmax_direction_prop(state):
    # calculates the cumulated softmax propability for every possible action
    current_policy = policy[state['y'], state['x'], :]  # prop in this agent_pos
    softmax_prop = numpy.exp(current_policy)
    softmax_prop = softmax_prop / numpy.sum(softmax_prop)  # softmax: (e^prop) / (sum(e^prop))
    cum_softmax_prop = numpy.cumsum(softmax_prop)  # cumulating
    return (cum_softmax_prop)


def pick_action(state):
    cum_softmax_prop = cum_softmax_direction_prop(state)
    r = numpy.random.rand()
    for i in range(len(cum_softmax_prop)):
        if cum_softmax_prop[i] > r:
            return i


while True:
    possible_actions = env.get_possible_actions()
    
    direction = pick_action(state)
    	
    last_state = state.copy()
    
    	
    outcome = 0	
    state, outcome, in_end_pos = env.move(possible_actions[direction])
    	
    time.sleep(0.02)
    state = env.getState().copy()

예제 #2

파일 보기

파일: 4_pong_play_muscle.py 프로젝트: CiNCFZJ/CiNC

        plot(fig, ax, nest.GetStatus(sd_wta, keys='events')[0])

        max_rate = -1
        chosen_action = -1
        for i in range(len(sd_actions)):
            rate = len([e for e in nest.GetStatus([sd_actions[i]], keys='events')[0]['times'] if e > last_action_time]) # calc the "firerate" of each actor population
            if rate > max_rate:
                max_rate = rate # the population with the hightes rate wins
                chosen_action = i

        nest.SetStatus(stimulus, {'rate': 5000.})

        possible_actions = env.get_possible_actions() 

        new_position, outcome, in_end_position = env.move(possible_actions[chosen_action])

        nest.SetStatus(wta_noise, {'rate': 0.})
        for t in range(4):
            nest.Simulate(5)
            time.sleep(0.01)
        
              
        last_action_time += 60
        actions_executed += 1
    else:
        position = env.get_agent_pos().copy()        
        _, in_end_position = env.init_new_trial()
        nest.SetStatus(nest.GetConnections(stimulus, states[position['x']][position['y']]), {'weight': 0.})

rplt.from_device(sd_wta, title="WTA circuit")