Python Agent.show_statevalue_function Exemples

Langage de programmation: Python

Espace de nommage/Pack: Agent.agent

Class/Type: Agent

Méthode/Fonction: show_statevalue_function

Exemples au hotexamples.com: 4

Python Agent.show_statevalue_function - 4 exemples trouvés. Ce sont les exemples réels les mieux notés de Agent.agent.Agent.show_statevalue_function extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

Agent(14)

plot_state(3)

MC_control(2)

TD_control(2)

show_statevalue_function(2)

TD_control_linear(1)

TD_control_linear_app(1)

store_Qvalue_function(1)

Méthodes fréquemment utilisées

Agent (14)

plot_state (3)

MC_control (2)

TD_control (2)

show_statevalue_function (2)

TD_control_linear (1)

TD_control_linear_app (1)

store_Qvalue_function (1)

Exemple #1

0

Afficher le fichier

Fichier : testing.py Projet : ruixu93/easy21

def test_linear_sarsa(iterations=1000, mlambda=None, n0=100, avg_it=100): print "\n-------------------" print "TD control Sarsa, with Linear function approximation" print "run for n. iterations: "+str(iterations) print "plot graph mse vs episodes for lambda equal 0 and lambda equal 1" print "list (std output) win percentage for values of lambda 0, 0.1, 0.2, ..., 0.9, 1" monte_carlo_Q = pickle.load(open("Data/Qval_func_1000000_MC_control.pkl", "rb")) n_elements = monte_carlo_Q.shape[0]*monte_carlo_Q.shape[1]*2 mse = [] if not isinstance(mlambda,list): # if no value is passed for lambda, default 0.5 l = 0.5 if mlambda==None else mlambda # learn game = Environment() agent = Agent(game, n0) agent.TD_control_linear(iterations,l,avg_it) agent.show_statevalue_function() else: # test each value of lambda for l in mlambda: game = Environment() agent = Agent(game, n0) l_mse = agent.TD_control_linear(iterations,l,avg_it) mse.append(l_mse) plt.plot(mlambda,mse) plt.ylabel('mse') plt.show()

Exemple #2

0

Afficher le fichier

def test_linear_sarsa(iterations=1000, mlambda=None, n0=100, avg_it=100): print "\n-------------------" print "TD control Sarsa, with Linear function approximation" print "run for n. iterations: " + str(iterations) print "plot graph mse vs episodes for lambda equal 0 and lambda equal 1" print "list (std output) win percentage for values of lambda 0, 0.1, 0.2, ..., 0.9, 1" monte_carlo_Q = pickle.load( open("Data/Qval_func_1000000_MC_control.pkl", "rb")) n_elements = monte_carlo_Q.shape[0] * monte_carlo_Q.shape[1] * 2 mse = [] if not isinstance(mlambda, list): # if no value is passed for lambda, default 0.5 l = 0.5 if mlambda == None else mlambda # learn game = Environment() agent = Agent(game, n0) agent.TD_control_linear(iterations, l, avg_it) agent.show_statevalue_function() else: # test each value of lambda for l in mlambda: game = Environment() agent = Agent(game, n0) l_mse = agent.TD_control_linear(iterations, l, avg_it) mse.append(l_mse) plt.plot(mlambda, mse) plt.ylabel('mse') plt.show()

Exemple #3

0

Afficher le fichier

Fichier : testing.py Projet : ruixu93/easy21

def test_monte_carlo(iterations=1000000, n0=100): print "\n-------------------" print "Monte Carlo control" print "run for n. iterations: "+str(iterations) print "win percentage: " # learn game = Environment() agent = Agent(game, n0) agent.MC_control(iterations) # plot and store agent.show_statevalue_function() agent.store_Qvalue_function()

Exemple #4

0

Afficher le fichier

def test_monte_carlo(iterations=1000000, n0=100): print "\n-------------------" print "Monte Carlo control" print "run for n. iterations: " + str(iterations) print "win percentage: " # learn game = Environment() agent = Agent(game, n0) agent.MC_control(iterations) # plot and store agent.show_statevalue_function() agent.store_Qvalue_function()