Python TimeStepBatch.from_episode_batch 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: garage

클래스/타입: TimeStepBatch

메소드/함수: from_episode_batch

hotexamples.com에서의 예제들: 2

Python TimeStepBatch.from_episode_batch - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 garage.TimeStepBatch.from_episode_batch에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

TimeStepBatch(29)

concatenate(4)

from_episode_batch(2)

from_time_step_list(2)

from_trajectory_batch(2)

split(1)

to_time_step_list(1)

예제 #1

파일 보기

def expert_source(env, goal, max_episode_length, n_eps):
    expert = OptimalPolicy(env.spec, goal=goal)
    workers = WorkerFactory(seed=100, max_episode_length=max_episode_length)
    expert_sampler = LocalSampler.from_worker_factory(workers, expert, env)
    for _ in range(n_eps):
        eps_batch = expert_sampler.obtain_samples(0, max_episode_length, None)
        yield TimeStepBatch.from_episode_batch(eps_batch)

예제 #2

파일 보기

def test_time_step_batch_from_episode_batch(eps_data):
    eps = EpisodeBatch(**eps_data)
    timestep_batch = TimeStepBatch.from_episode_batch(eps)
    assert (timestep_batch.observations == eps.observations).all()
    assert (timestep_batch.next_observations[:eps.lengths[0] - 1] ==
            eps.observations[1:eps.lengths[0]]).all()
    assert (timestep_batch.next_observations[eps.lengths[0]] ==
            eps.last_observations[0]).all()