Python NeuralTeleportationModel.load_state_dict 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: neuralteleportation.neuralteleportationmodel

메소드/함수: load_state_dict

hotexamples.com에서의 예제들: 2

Python NeuralTeleportationModel.load_state_dict - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 neuralteleportation.neuralteleportationmodel.NeuralTeleportationModel.load_state_dict에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

NeuralTeleportationModel(16)

get_weights(15)

random_teleport(14)

set_weights(8)

cpu(6)

to(4)

get_cob(4)

teleport(3)

calculate_cob(3)

state_dict(2)

set_params(2)

reset_weights(2)

initialize_cob(2)

load_state_dict(2)

get_grad(2)

eval(2)

cuda(2)

init_like_histogram(1)

get_params(1)

generate_random_cob(1)

teleport_activations(1)

train(1)

예제 #1

파일 보기

파일: optim.py 프로젝트: vitalab/neuralteleportation

def loss_lookahead_diff(model: NeuralTeleportationModel, data: Tensor, target: Tensor,
                        metrics: TrainingMetrics, config: OptimalTeleportationTrainingConfig, **kwargs) -> Number:
    # Save the state of the model, prior to performing the lookahead
    state_dict = model.state_dict()

    # Initialize a new optimizer to perform lookahead
    optimizer = get_optimizer_from_model_and_config(model, config)
    optimizer.zero_grad()

    # Compute loss at the teleported point
    loss = torch.stack([metrics.criterion(model(data_batch), target_batch)
                        for data_batch, target_batch in zip(data, target)]).mean(dim=0)

    # Take a step using the gradient at the teleported point
    loss.backward()

    # Compute loss after the optimizer step
    lookahead_loss = torch.stack([metrics.criterion(model(data_batch), target_batch)
                                  for data_batch, target_batch in zip(data, target)]).mean(dim=0)

    # Restore the state of the model prior to the lookahead
    model.load_state_dict(state_dict)

    # Compute the difference between the lookahead loss and the original loss
    return (loss - lookahead_loss).item()

예제 #2

파일 보기

    hidden_layers = (128, 10)

    net1 = MLPCOB(input_shape=(1, 28, 28),
                  num_classes=10,
                  hidden_layers=hidden_layers).to(device)
    if args.same_init:
        net2 = deepcopy(net1)
    else:
        net2 = MLPCOB(input_shape=(1, 28, 28),
                      num_classes=10,
                      hidden_layers=hidden_layers).to(device)

    model1 = NeuralTeleportationModel(network=net1,
                                      input_shape=sample_input_shape)
    if args.weights1 is not None:
        model1.load_state_dict(torch.load(args.weights1))
    config.batch_size = 8  # Change batch size to train to different minima
    train(model1,
          train_dataset=mnist_train,
          metrics=metrics,
          config=config,
          val_dataset=mnist_test)
    torch.save(model1.state_dict(), pjoin(save_path, 'model1.pt'))
    print("Model 1 test results: ", test(model1, mnist_test, metrics, config))

    model2 = NeuralTeleportationModel(network=net2,
                                      input_shape=sample_input_shape)
    if args.weights2 is not None:
        model2.load_state_dict(torch.load(args.weights2))
    config.batch_size = 512  # Change batch size to train to different minima
    train(model2,