Python construct_episodesの例

プログラミング言語: Python

名前空間/パッケージ名: alpacka.testing

メソッド/関数: construct_episodes

hotexamples.comのコード掲載数: 3

Python construct_episodes - 3件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのalpacka.testing.construct_episodesの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

def mock_bstep():
    """Mock batch stepper class with fixed run_episode_batch return."""
    bstep_cls = mock.create_autospec(batch_steppers.LocalBatchStepper)
    bstep_cls.return_value.run_episode_batch.return_value = (
        testing.construct_episodes(
            actions=[
                [0, 2], [0, 2], [0, 2],  # Three first episodes action 0
                [1, 2], [1, 2], [1, 2],  # Three last episodes action 1
            ],
            rewards=[
                [0, 1], [0, 1], [0, 1],  # Higher mean return, action 0
                [0, 0], [0, 0], [0, 2],  # Higher max return, action 1
            ], truncated=True))
    return bstep_cls

コード例 #2

ファイルを表示

def test_bootstrap_return_with_quality_estimator(truncated, x_return):
    # Set up
    episode = testing.construct_episodes(actions=[
        [0, 1, 2, 3],
    ],
                                         rewards=[[0, 2, 0, -1]],
                                         truncated=truncated)[0]
    logits = (np.array([[7, 3, 4]]), None)

    # Run
    bootstrap_return = testing.run_with_constant_network_prediction(
        shooting.bootstrap_return_with_quality(episode), logits=logits)

    # Test
    assert bootstrap_return == x_return

コード例 #3

ファイルを表示

def test_bootstrap_return_with_value_estimator(truncated, x_return):
    # Set up
    episode = testing.construct_episodes(
        actions=[
            [0, 1, 2, 3],
        ],
        rewards=[
            [0, 2, 0, -1]
        ],
        truncated=truncated
    )[0]
    logits = (np.array([[7]]), None)

    # Run
    bootstrap_return = testing.run_with_constant_network_prediction(
        mc_simulation.bootstrap_return_with_value([episode]),
        logits=logits
    )[0]

    # Test
    assert bootstrap_return == x_return