Python MDP.get_handicapped 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: MDP

클래스/타입: MDP

메소드/함수: get_handicapped

hotexamples.com에서의 예제들: 2

Python MDP.get_handicapped - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 MDP.MDP.get_handicapped에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MDP(30)

getRewards(12)

probNextStates(12)

add_state(5)

get_action(5)

num_states(4)

get_action_list(4)

get_state_list(4)

__init__(3)

value_iteration(3)

reset(3)

get_parked(3)

get_Q_policy(3)

allStates(2)

num_actions(2)

numStates(2)

numActions(2)

initMDP(2)

get_available(2)

get_handicapped(2)

startState(2)

endStates(2)

gamma(2)

randomWalkSamples(1)

add_action(1)

valueIteration(1)

update_reward_only(1)

randomAction(1)

update_info(1)

representValues(1)

solve(1)

transform(1)

set_policy(1)

show(1)

train(1)

printAns(1)

take_action(1)

printResult(1)

buildMDP(1)

policyIteration(1)

apply_action_on_grid(1)

calc_rewards(1)

computePolicy(1)

environment(1)

getOptimalPolicy(1)

getOptimalValues(1)

build(1)

get_actions(1)

policyEvaluation(1)

get_reward(1)

예제 #1

파일 보기

파일: Simulator.py 프로젝트: hillst/CS533_proj5

def run_simulation(MDP, policy):
    print "Starting simulation for given MDP"

    while MDP.get_parked() == False:
        action = policy.choose_action(MDP.get_time())
        print "[TIME", MDP.get_time() ,"]:", policy.get_name(), "chose action", action
        MDP.take_action(action)
        print "[TIME", MDP.get_time() ,"]: Moved to state", MDP.get_state(), "Current reward %.3f." % MDP.get_reward()
    print "Exited in (spot, handicapped, available):", MDP.get_spot(), MDP.get_handicapped(), MDP.get_available()

예제 #2

파일 보기

파일: Simulator.py 프로젝트: hillst/CS533_proj4

def evaluate_policies(policy, MDP):
    total_reward, handicapped, crashed = 0,0,0
    num_sims = 10000
    for i in range(num_sims):
        run_simulation(MDP, policy)
        #maybe do something fancier
        total_reward += MDP.get_reward()
        if MDP.get_handicapped():
            handicapped += 1
        if not MDP.get_available():
            crashed += 1
        MDP.reset()
    print policy.get_name(), total_reward / num_sims, handicapped, crashed