Python MDP.solve_reach示例

编程语言: Python

命名空间/包名称: best.mdp

类/类型: MDP

方法/功能: solve_reach

hotexamples.com的示例: 2

Python MDP.solve_reach - 已找到2个示例。这些是从开源项目中提取的最受好评的best.mdp.MDP.solve_reach现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

MDP(17)

product(5)

solve_reach(2)

solve_reach_constrained(2)

T(1)

prune(1)

示例#1

显示文件

def test_reach():
    T0 = np.array([[0.5, 0.25, 0.25], [0, 1, 0], [0, 0, 1]])
    mdp = MDP([T0])

    V, _ = mdp.solve_reach(accept=lambda y: y == 2)

    np.testing.assert_almost_equal(V[0], [0.5, 0, 1], decimal=4)

示例#2

显示文件

def test_reach_finitetime():

    T0 = np.array([[0.9, 0, 0.1], [0, 1, 0], [0, 0, 1]])
    T1 = np.array([[0, 0.5, 0.5], [0, 1, 0], [0, 0, 1]])

    mdp = MDP([T0, T1])

    accept = lambda n: n == 2

    vlist, plist = mdp.solve_reach(accept, horizon=3)

    np.testing.assert_almost_equal(vlist[0][0], 0.1 + 0.9 * 0.1 + 0.9**2 * 0.5)
    np.testing.assert_almost_equal(vlist[1][0], 0.1 + 0.9 * 0.5)
    np.testing.assert_almost_equal(vlist[2][0], 0.5)

    np.testing.assert_almost_equal(plist[0][0], 0)
    np.testing.assert_almost_equal(plist[1][0], 0)
    np.testing.assert_almost_equal(plist[2][0], 1)