Exemplos de PolicyIteration em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: funzo.planners

Classe / Tipo: PolicyIteration

Exemplos em hotexamples.com: 2

PolicyIteration em Python - 2 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de funzo.planners.PolicyIteration em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

PolicyIteration(2)

solve(2)

Métodos Frequentes

PolicyIteration (2)

solve (2)

Relacionados

reopen_current_file

factorial

convert_to_html

get_char_info

ExceptionHandler

callLater

EmailInscricao

md5_constructor

get

tweet

Related in langs

dropbox_validate_file (PHP)

Soap (PHP)

PdProductAssociation (C#)

IDrumKit (C#)

Convert_Array_Inplace (C++)

ListEmpty (C++)

New (Go)

AddSync (Go)

FacesUtil (Java)

Vector2 (Java)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: pw_rl.py Projeto: tobywise/funzo

def main(): with PuddleWorld(start=(0.5, 0.1), resolution=0.05) as world: # R = PuddleReward(rmax=1.0, step_reward=0.1) R = PuddleRewardLFA(weights=[1, -1], rmax=1.0) T = PWTransition() g = PuddleWorldMDP(reward=R, transition=T, discount=0.98) # ------------------------ mdp_planner = PolicyIteration() res = mdp_planner.solve(g) V = res['V'] print(V) print(res['pi']) fig = plt.figure(figsize=(8, 8)) ax = fig.gca() ax = world.visualize(ax, policy=res['pi']) # plt.savefig('world.svg') plt.figure(figsize=(8, 8)) plt.imshow(V.reshape(world.shape).T, # interpolation='nearest', cmap='viridis', origin='lower', vmin=np.min(V), vmax=np.max(V)) plt.grid(False) plt.title('Value function') plt.colorbar(orientation='horizontal') # plt.savefig('world_value.svg') plt.show()

Exemplo n.º 2

0

Exibir arquivo

def main(): NUM_STATES = 10 with ChainWorld(num_states=NUM_STATES) as world: R = ChainReward() T = ChainTransition() mdp = ChainMDP(R, T, discount=0.98) planner = PolicyIteration() plan = planner.solve(mdp) print(plan['pi']) fig = plt.figure(figsize=(12, 3)) ax = fig.gca() ax = world.visualize(ax) ax = world.show_policy(ax, policy=plan['pi']) plt.figure(figsize=(8, 8)) plt.plot(plan['V']) plt.title('Value function') plt.show()