Python BertModel.init_weights Beispiele

Programmiersprache: Python

Namespace / Paketname: transformers

Klasse / Typ: BertModel

Methode / Funktion: init_weights

Beispiele auf hotexamples.com: 1

Python BertModel.init_weights - 1 Beispiele gefunden. Dies sind die am besten bewerteten Python Beispiele für die transformers.BertModel.init_weights, die aus Open Source-Projekten extrahiert wurden. Sie können Beispiele bewerten, um die Qualität der Beispiele zu verbessern.

Häufig verwendete Methoden

Anzeigen Verbergen

BertModel(30)

from_pretrained(30)

parameters(30)

eval(21)

to(19)

load_state_dict(18)

named_parameters(11)

state_dict(9)

train(7)

embeddings(6)

set_input_embeddings(5)

cuda(5)

get_extended_attention_mask(4)

get_input_embeddings(3)

forward(3)

save_pretrained(3)

__init__(2)

encoder(2)

from_config(1)

named_modules(1)

pooler(1)

resize_token_embeddings(1)

shared(1)

apply(1)

init_weights(1)

Beispiel #1

Datei anzeigen

class UniCoilEncoder(PreTrainedModel):
    config_class = BertConfig
    base_model_prefix = 'coil_encoder'
    load_tf_weights = None

    def __init__(self, config: BertConfig):
        super().__init__(config)
        self.config = config
        self.bert = BertModel(config)
        self.tok_proj = torch.nn.Linear(config.hidden_size, 1)
        self.init_weights()

    # Copied from transformers.models.bert.modeling_bert.BertPreTrainedModel._init_weights
    def _init_weights(self, module):
        """ Initialize the weights """
        if isinstance(module, (torch.nn.Linear, torch.nn.Embedding)):
            # Slightly different from the TF version which uses truncated_normal for initialization
            # cf https://github.com/pytorch/pytorch/pull/5617
            module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)
        elif isinstance(module, torch.nn.LayerNorm):
            module.bias.data.zero_()
            module.weight.data.fill_(1.0)
        if isinstance(module, torch.nn.Linear) and module.bias is not None:
            module.bias.data.zero_()

    def init_weights(self):
        self.bert.init_weights()
        self.tok_proj.apply(self._init_weights)

    def forward(
            self,
            input_ids: torch.Tensor,
            attention_mask: Optional[torch.Tensor] = None,
    ):
        input_shape = input_ids.size()
        device = input_ids.device
        if attention_mask is None:
            attention_mask = (
                torch.ones(input_shape, device=device)
                if input_ids is None
                else (input_ids != self.bert.config.pad_token_id)
            )
        outputs = self.bert(input_ids=input_ids, attention_mask=attention_mask)
        sequence_output = outputs.last_hidden_state
        tok_weights = self.tok_proj(sequence_output)
        tok_weights = torch.relu(tok_weights)
        return tok_weights