Python FeedForward.get_trainable_flat 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: Policies.PyTorch

클래스/타입: FeedForward

메소드/함수: get_trainable_flat

hotexamples.com에서의 예제들: 2

Python FeedForward.get_trainable_flat - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 Policies.PyTorch.FeedForward.get_trainable_flat에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

FeedForward(7)

build_model(7)

activate(4)

load(3)

save(3)

action_parser(2)

compute_virtual_normalization(2)

get_trainable_flat(2)

set_trainable_flat(2)

activate_batch(1)

get_action(1)

get_actions_on_batch(1)

예제 #1

파일 보기

def run_test():
    cfg = {"rng": np.random}
    input_shape = 2
    output_shape = 2
    instructions = \
    {
        "init_std": 0.05,
        "layers" : [1],
        "layer_functions" : ['relu'],
        "layer_extras" : ['bn'],
        "output_function" : 'linear',
        "output_extras" : 'bn',
    }
    policy = FeedForward(input_shape, output_shape, None, cfg)
    policy.build_model(instructions)

    print("BUILT POLICY LAYERS:")
    for layer in policy.model:
        print(layer)

    flat = np.random.randn(policy.num_params)

    print("POLICY FLAT BEFORE SETTING:", policy.get_trainable_flat())
    policy.set_trainable_flat(flat)
    print("POLICY FLAT AFTER SETTING:", policy.get_trainable_flat())

    print("FLAT SHOULD NOW BE:", flat)

예제 #2

파일 보기

def test_save_load_vbn():
    cfg = {"rng": np.random}
    input_shape = 8
    output_shape = 8
    instructions = \
        {
            "init_std": 0.05,
            "layers": [64, 64],
            "layer_functions": ['relu', 'relu'],
            "layer_extras": ['bn', 'bn'],
            "output_function": 'linear',
            "output_extras": 'bn',
        }
    policy = FeedForward(input_shape, output_shape, None, cfg)
    policy.build_model(instructions)

    print("BUILT POLICY LAYERS:")
    for layer in policy.model:
        print(layer)

    vbn = [np.random.randn(input_shape) for _ in range(1000)]
    policy.compute_virtual_normalization(vbn)

    inp = np.ones(input_shape)
    out = policy.activate(inp)

    print("\nOUTPUT ON ONES WITH VBN BEFORE SAVING:", out)
    policy.save("data/experiments/exp_name/epochs/epoch_0/policy")

    out = policy.activate(inp)
    print("\nOUTPUT ON ONES WITH VBN AFTER SAVING:", out)

    policy.set_trainable_flat(policy.get_trainable_flat() +
                              np.random.randn(policy.num_params))
    out = policy.activate(inp)
    print("\nJIGGLED OUTPUT ON ONES WITH VBN BEFORE LOADING", out)

    policy.load("data/experiments/exp_name/epochs/epoch_0/policy")
    out = policy.activate(inp)
    print("OUTPUT ON ONES WITH VBN AFTER LOADING:", out)