Python BlenderbotForConditionalGeneration.halfの例

プログラミング言語: Python

名前空間/パッケージ名: transformers

クラス/型: BlenderbotForConditionalGeneration

メソッド/関数: half

hotexamples.comのコード掲載数: 2

Python BlenderbotForConditionalGeneration.half - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのtransformers.BlenderbotForConditionalGeneration.halfの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

from_pretrained(7)

BlenderbotForConditionalGeneration(4)

half(2)

eval(1)

generate(1)

save_pretrained(1)

to(1)

コード例 #1

ファイルを表示

ファイル: convert_blenderbot_original_pytorch_checkpoint_to_pytorch.py プロジェクト: yulinggu-cs/transformers

def convert_parlai_checkpoint(checkpoint_path, pytorch_dump_folder_path,
                              config_json_path):
    """
    Copy/paste/tweak model's weights to our BERT structure.
    """
    model = torch.load(checkpoint_path, map_location="cpu")
    sd = model["model"]
    cfg = BlenderbotConfig.from_json_file(config_json_path)
    m = BlenderbotForConditionalGeneration(cfg)
    valid_keys = m.model.state_dict().keys()
    failures = []
    mapping = {}
    for k, v in sd.items():
        if k in IGNORE_KEYS:
            continue

        new_k = rename_state_dict_key(k)
        if new_k not in valid_keys:
            failures.append([k, new_k])
        else:
            mapping[new_k] = v
    if cfg.normalize_before:  # Blenderbot-3B checkpoints. Rename layernorm_embedding -> layer_norm
        rename_layernorm_keys(sd)
    m.model.load_state_dict(mapping, strict=True)
    m.half()
    m.save_pretrained(pytorch_dump_folder_path)

コード例 #2

ファイルを表示

 def test_generate_fp16(self):
     config, input_dict = self.model_tester.prepare_config_and_inputs()
     input_ids = input_dict["input_ids"]
     attention_mask = input_ids.ne(1).to(torch_device)
     model = BlenderbotForConditionalGeneration(config).eval().to(torch_device)
     if torch_device == "cuda":
         model.half()
     model.generate(input_ids, attention_mask=attention_mask)
     model.generate(num_beams=4, do_sample=True, early_stopping=False, num_return_sequences=3)