Python GPTNeoForCausalLM.GPTNeoForCausalLM Exemples

Langage de programmation: Python

Espace de nommage/Pack: transformers

Méthode/Fonction: GPTNeoForCausalLM

Exemples au hotexamples.com: 2

Python GPTNeoForCausalLM.GPTNeoForCausalLM - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de transformers.GPTNeoForCausalLM.GPTNeoForCausalLM extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

from_pretrained(16)

GPTNeoForCausalLM(2)

eval(1)

gradient_checkpointing_enable(1)

save_pretrained(1)

to(1)

Méthodes fréquemment utilisées

from_pretrained (16)

GPTNeoForCausalLM (2)

eval (1)

gradient_checkpointing_enable (1)

save_pretrained (1)

to (1)

Exemple #1

0

Afficher le fichier

def create_and_check_forward_and_backwards(self, config, input_ids, input_mask, head_mask, token_type_ids, *args): model = GPTNeoForCausalLM(config) model.to(torch_device) result = model(input_ids, token_type_ids=token_type_ids, labels=input_ids) self.parent.assertEqual(result.loss.shape, ()) self.parent.assertEqual(result.logits.shape, (self.batch_size, self.seq_length, self.vocab_size)) result.loss.backward()

Exemple #2

0

Afficher le fichier

def convert_tf_checkpoint_to_pytorch(tf_checkpoint_path, config_file, pytorch_dump_path): # Initialise PyTorch model config_json = json.load(open(config_file, "r")) config = GPTNeoConfig( hidden_size=config_json["n_embd"], num_layers=config_json["n_layer"], num_heads=config_json["n_head"], attention_types=config_json["attention_types"], max_position_embeddings=config_json["n_positions"], resid_dropout=config_json["res_dropout"], embed_dropout=config_json["embed_dropout"], attention_dropout=config_json["attn_dropout"], ) print(f"Building PyTorch model from configuration: {config}") model = GPTNeoForCausalLM(config) # Load weights from tf checkpoint load_tf_weights_in_gpt_neo(model, config, tf_checkpoint_path) # Save pytorch-model print(f"Save PyTorch model to {pytorch_dump_path}") model.save_pretrained(pytorch_dump_path)