Esempi in Python per RL.direct_DQN

Linguaggio di programmazione: Python

Classe/tipologia: RL

Metodo/funzione: direct_DQN

Esempi su hotexamples.com: 2

RL.direct_DQN in Python: 2 esempi trovati. Questi sono i migliori esempi reali in Python per RL.direct_DQN da pachetto Outsmart, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

Model(5)

RL(3)

filter_states(3)

fill_missing_sum_states(3)

finiteMDP(2)

FB_GS(2)

Memory(2)

direct_DQN(2)

convert_to_value_function(2)

convert_to_sum_states(2)

choose_action(2)

DQN(1)

draw(1)

createGraph(1)

TrainDQN(1)

QMemory(1)

ReplayMemory(1)

DQN_measurement(1)

QLearning_NN(1)

QLearningTable(1)

QLearn(1)

PolicyGradient(1)

Player(1)

Manager(1)

FB_SimpleCoarseMarkovDecayEA(1)

Env(1)

DeepQNetwork(1)

getEpsilon(1)

Esempio n. 1

Mostra file

    time_of_start = time.time()
    # set the title of the terminal so that what the terminal is doing is clear
    print('\33]0;{}\a'.format(' '.join(sys.argv)), end='', flush=True)
    print(args)

    # compile the simulation module in C
    check_C_module_and_compile()

    # set the replay memory
    capacity = round(args.size_of_replay_memory * controls_per_half_period *
                     t_max) if args.train else 1
    memory = RL.Memory(capacity = capacity, data_size = data_size * 2 + 2 if args.input != 'measurements' else \
                                            (read_control_step_length+read_length) + read_length//read_control_step_length+1 + 2,
                                            policy = 'random', passes_before_random = 0.2)
    # define the neural network
    net = RL.direct_DQN(data_size).cuda(
    ) if args.input != 'measurements' else RL.DQN_measurement(read_length)
    # set the task
    if args.train or args.LQG:
        train = RL.TrainDQN(net,
                            memory,
                            batch_size=args.batch_size,
                            gamma=0.99,
                            backup_period=args.target_network_update_interval,
                            args=args)
        del net
        # the main function of training
        if args.train:
            main = Main_System(train, num_of_processes=args.num_of_actors)
            main(num_of_episodes)
        # when we do not train and we test the result of LQG
        elif args.LQG:

Esempio n. 2

Mostra file

File: main_parallel.py Progetto: sborah53/DeepReinforcementLearningControlOfQuantumCartpoles

    # set the title of the terminal so that what the terminal is doing is clear
    print('\33]0;{}\a'.format(' '.join(sys.argv)), end='', flush=True)
    print(args)

    # compile the simulation module in C
    check_C_module_and_compile()

    # set the replay memory
    capacity = round(args.size_of_replay_memory * controls_per_unit_time *
                     t_max) if args.train else 1
    memory = RL.Memory(capacity=capacity,
                       data_size=data_size * 2 + 2,
                       policy='random',
                       passes_before_random=0.2)
    # define the neural network
    net = RL.direct_DQN(data_size).cuda()
    # set the task
    if args.train or args.control_strategy != 'DQN':
        train = RL.TrainDQN(net,
                            memory,
                            batch_size=args.batch_size,
                            gamma=0.99,
                            backup_period=args.target_network_update_interval,
                            args=args)
        del net
        # the main function of training
        if args.train:
            main = Main_System(train, num_of_processes=args.num_of_actors)
            main(num_of_episodes)
        # when we do not train and we test the result of analytic strategies
        elif args.control_strategy != 'DQN':