Python Simulator.resetの例

プログラミング言語: Python

名前空間/パッケージ名: sim

クラス/型: Simulator

メソッド/関数: reset

hotexamples.comのコード掲載数: 2

Python Simulator.reset - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのsim.Simulator.resetの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Simulator(23)

run(4)

step(2)

reset(2)

top(2)

get_screen(1)

get_storage_data(1)

initialize(1)

intialise_dynamic(1)

load_contract(1)

evolve(1)

run_simulation(1)

select_action(1)

setup(1)

sim_done(1)

start(1)

tx(1)

コード例 #1

ファイルを表示

            lstm_layer = agent.model.layers[0]
            # Store lstm states
            state_record = lstm_layer.states
            # Reset states
            agent.model.layers[0].reset_states()
            agent.target_model.layers[0].reset_states()

            agent.train_replay()
            # Restore states
            agent.model.layers[0].states = state_record

            score += reward
            state = next_state

            if done:
                sim.reset()
                break

        # every episode update the target model to be same with model
        agent.update_target_model()

        # every episode, plot the play time
        scores.append(score)
        episodes.append(e)
        #plt.plot(episodes, scores, 'b')
        '''
        Plot normalized data
        '''
        if False:
            try:
                t = np.arange(len(data))

コード例 #2

ファイルを表示

ファイル: torch_run.py プロジェクト: DinoGi/tradebot

    agent.load_state()
    perturb = torch.from_numpy(np.random.rand(10,10) / 1)
    agent.model.state_dict()['linear2.weight'] += perturb.float()
    perturb2 = torch.from_numpy(np.random.rand(4,10) / 1)
    agent.model.state_dict()['linear3.weight'] += perturb2.float()
    print(agent.model.state_dict())
    '''

    losses, scores, episodes = [], [], []

    sim = Simulator(orig_data, data, windowsize=windowsize)

    for e in range(EPISODES):
        # Write actions to log file
        score = 0
        state = Tensor(sim.reset())

        while not sim.sim_done():
            #state = Tensor(sim.state) # Get state
            action = agent.get_action(state)

            # Simulate trading
            #-----------
            max_idx = np.argmax(action[:3])  # Choose buy/sell/hold
            next_state, reward, done = sim.step(max_idx, action[3])
            next_state = Tensor(next_state)
            #-----------

            # save the sample <s, a, r, s'> to the replay memory
            agent.replay_memory(state, action, reward, next_state, done)
            state = next_state.clone()