Python RocketLander.step 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: environments.rocketlander

클래스/타입: RocketLander

메소드/함수: step

hotexamples.com에서의 예제들: 2

Python RocketLander.step - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 environments.rocketlander.RocketLander.step에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

RocketLander(3)

refresh(2)

render(2)

reset(2)

step(2)

_render(1)

_reset(1)

_step(1)

apply_random_x_disturbance(1)

apply_random_y_disturbance(1)

close(1)

draw_marker(1)

get_consumed_fuel(1)

move_barge_randomly(1)

예제 #1

파일 보기

                        a = randaction
                        #print(" - selecting generated optimal policy ",a)

#        for i in range (np.alen(a)):
#            if a[i] < -1: a[i]=-0.99999999999
#            if a[i] > 1: a[i] = 0.99999999999
        if step % 50 == 0:
            print("a =>", a)

        env.render()
        env.refresh(render=True)

        qs_a = np.concatenate((qs, a), axis=0)

        #get the target state and reward
        s, r, done, info = env.step(a)
        #record only the first x number of states

        #if done and step<max_steps-3:
        #    r = -50

        if step == 0:
            gameSA[0] = qs_a
            gameS[0] = qs
            gameR[0] = np.array([r])
            gameA[0] = np.array([r])
            gameW[0] = np.array([0.000000005])
        else:
            gameSA = np.vstack((gameSA, qs_a))
            gameS = np.vstack((gameS, qs))
            gameR = np.vstack((gameR, np.array([r])))

예제 #2

파일 보기

파일: run.py 프로젝트: gabrielgarza/rocket-lander

        initial_epsilon = 0.8
    )

    observation = env.reset()

    left_or_right_barge_movement = np.random.randint(0, 2)
    epsilon = 0.05


    for episode in range(EPISODES):
        while True:
            # 1. Choose an action based on observation
            action = PG.choose_action(observation)

            # 2. Take action in the environment
            observation_, reward, done, info = env.step(action)

            # 3. Store transition for training
            # if reward > -0.20:
            PG.store_transition(observation, action, reward)

            if RENDER_ENV:
                # -------------------------------------
                # Optional render
                env.render()
                # Draw the target
                env.draw_marker(env.landing_coordinates[0], env.landing_coordinates[1])
                # Refresh render
                env.refresh(render=False)

            # When should the barge move? Water movement, dynamics etc can be simulated here.