Esempi in Python per Agent.test

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: rl.agent

Classe/tipologia: Agent

Metodo/funzione: test

Esempi su hotexamples.com: 2

Agent.test in Python: 2 esempi trovati. Questi sono i migliori esempi reali in Python per rl.agent.Agent.test, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

Agent(18)

train(6)

save_values(3)

demo(3)

test(2)

store_transition(2)

get_action(2)

qvalue(1)

reward(1)

save_model(1)

save_models(1)

select_action(1)

save_weights(1)

load_weights(1)

start(1)

stats(1)

take_action(1)

qmatrix_to_str(1)

learn(1)

load_models(1)

act(1)

join(1)

init_scaler(1)

get_state(1)

fit(1)

eval(1)

episode_start(1)

episode_end(1)

compile(1)

choose_action(1)

argmin(1)

argmax(1)

addenv(1)

add_module(1)

update_qmatrix(1)

Esempio n. 1

Mostra file

File: util.py Progetto: budi-kurniawan/python-gridworld

    def policy_found(q, steps):
        from rl.environment import Environment
        from rl.agent import Agent
        from rl.stateaction import StateAction
        environment = Environment()
        agent = Agent(environment, Util.get_state_actions, q, 1, 1)
        maxStepsAllowed = Util.num_cols + Util.num_rows

        stepsToGoal = 0
        while stepsToGoal < maxStepsAllowed:
            stepsToGoal += 1
            prevState = agent.get_state()
            agent.test()
            action = agent.get_action()
            if prevState != Util.MIN_VALUE:
                steps.append(StateAction(prevState, action))

            if agent.get_state() == Util.get_goal_state():
                return True
            if agent.terminal:
                return False
        return agent.get_state() == Util.get_goal_state()

Esempio n. 2

Mostra file

File: breakout.py Progetto: soca-git/mlbreakout

               processor=processor, warmup_steps=50000, gamma=.99, target_model_update=10000,
               train_interval=4, delta_clip=1.)

# learning rate
# 
dqn.compile(Adam(lr=.00025), metrics=['mae'])



#=== TRAIN ===#

if args.mode == 'train':
    checkpoint_weights_filename = 'weights_{step}.h5f'
    log_filename = 'dqn_log.json'
    callbacks = [ModelIntervalCheckpoint(checkpoint_weights_filename, interval=250000)]
    callbacks += [FileLogger(log_filename, interval=100)]


    dqn.fit(env, callbacks=callbacks, steps=1750000, log_interval=10000)

    # After training is done, we save the final weights
    dqn.save_weights('final_weights.h5f', overwrite=True)



#=== TEST ===#

elif args.mode == 'test':
    dqn.load_weights('trained_data/final_weights.h5f')
    dqn.test(env, episodes=10, visualize=True)