Ejemplos de FiftyChain.p_action_failure en Python

Lenguaje de programación: Python

Namespace/Package Name: rlpy.Domains

Clase / Tipo: FiftyChain

Método / Función: p_action_failure

Ejemplos en hotexamples.com: 2

Python FiftyChain.p_action_failure - 2 ejemplos encontrados. Estos son los ejemplos en Python del mundo real mejor valorados de rlpy.Domains.FiftyChain.p_action_failure extraídos de proyectos de código abierto. Puedes valorar ejemplos para ayudarnos a mejorar la calidad de los ejemplos.

Métodos usados con frecuencia

Mostrar Ocultar

FiftyChain(4)

p_action_failure(1)

s0(1)

state(1)

Ejemplo n.º 1

Mostrar archivo

def test_transitions():
    """
    Ensure that actions result in expected state transition behavior.
    Note that if the agent attempts to leave the edge
    (select LEFT from s0 or RIGHT from s49) then the state should not change.
    NOTE: assume p_action_failure is only noise term.

    """
    # [[initialize domain]]
    domain = FiftyChain()
    domain.p_action_failure = 0.0  # eliminate stochasticity
    dummyS = domain.s0()
    domain.state = 2  # state s2
    left = domain.LEFT
    right = domain.RIGHT
    goals = domain.GOAL_STATES

    # Check basic step
    r, ns, terminal, possibleA = domain.step(left)
    assert ns == 1 and terminal == False
    assert np.all(possibleA == np.array([left,
                                         right]))  # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

    # Check another basic step
    r, ns, terminal, possibleA = domain.step(left)
    assert ns == 0 and terminal == False
    assert np.all(possibleA == np.array([left,
                                         right]))  # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

    # Ensure state does not change or wrap around and that all actions
    # remain availableon corner case, per domain spec
    r, ns, terminal, possibleA = domain.step(left)
    assert ns == 0 and terminal == False
    assert np.all(possibleA == np.array([left,
                                         right]))  # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

    # A final basic step
    r, ns, terminal, possibleA = domain.step(right)
    assert ns == 1 and terminal == False
    assert np.all(possibleA == np.array([left,
                                         right]))  # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

Ejemplo n.º 2

Mostrar archivo

Archivo: test_FiftyChain.py Proyecto: MLDL/rlpy

def test_transitions():
    """
    Ensure that actions result in expected state transition behavior.
    Note that if the agent attempts to leave the edge
    (select LEFT from s0 or RIGHT from s49) then the state should not change.
    NOTE: assume p_action_failure is only noise term.

    """
    # [[initialize domain]]
    domain = FiftyChain()
    domain.p_action_failure = 0.0 # eliminate stochasticity
    dummyS = domain.s0()
    domain.state = 2 # state s2
    left = domain.LEFT
    right = domain.RIGHT
    goals = domain.GOAL_STATES

    # Check basic step
    r,ns,terminal,possibleA = domain.step(left)
    assert ns == 1 and terminal == False
    assert np.all(possibleA == np.array([left, right])) # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

    # Check another basic step
    r,ns,terminal,possibleA = domain.step(left)
    assert ns == 0 and terminal == False
    assert np.all(possibleA == np.array([left, right])) # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

    # Ensure state does not change or wrap around and that all actions
    # remain availableon corner case, per domain spec
    r,ns,terminal,possibleA = domain.step(left)
    assert ns == 0 and terminal == False
    assert np.all(possibleA == np.array([left, right])) # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0

    # A final basic step
    r,ns,terminal,possibleA = domain.step(right)
    assert ns == 1 and terminal == False
    assert np.all(possibleA == np.array([left, right])) # all actions available
    if ns in goals: assert r > 0
    else: assert r <= 0