Ejemplos de LongTensor.bmm en Python

Lenguaje de programación: Python

Namespace/Package Name: torch

Clase / Tipo: LongTensor

Método / Función: bmm

Ejemplos en hotexamples.com: 1

Python LongTensor.bmm - 1 ejemplos encontrados. Estos son los ejemplos en Python del mundo real mejor valorados de torch.LongTensor.bmm extraídos de proyectos de código abierto. Puedes valorar ejemplos para ayudarnos a mejorar la calidad de los ejemplos.

Métodos usados con frecuencia

Mostrar Ocultar

LongTensor(30)

cpu(30)

max(30)

float(30)

dim(30)

long(30)

size(30)

new_zeros(25)

clone(24)

cuda(20)

repeat(16)

detach(15)

eq(14)

new(13)

numpy(11)

new_ones(11)

reshape(10)

repeat_interleave(10)

ne(8)

new_full(8)

gather(8)

bool(7)

new_tensor(7)

contiguous(7)

nonzero(6)

narrow(6)

numel(5)

masked_fill(5)

ndimension(5)

flatten(4)

expand(4)

byte(4)

new_empty(4)

index_select(3)

flip(3)

min(3)

masked_select(2)

masked_scatter(2)

int(2)

neg(1)

mul(1)

bmm(1)

matmul(1)

argmax(1)

log(1)

get_device(1)

fill_(1)

permute(1)

expand_as(1)

cumsum(1)

Ejemplo n.º 1

Mostrar archivo

Archivo: nmt_seq2seq.py Proyecto: nadre/athnlp-labs

    def _compute_attention(
        self,
        decoder_hidden_state: torch.LongTensor = None,
        encoder_outputs: torch.LongTensor = None,
        encoder_outputs_mask: torch.LongTensor = None
    ) -> (torch.Tensor, torch.Tensor):
        """Apply attention over encoder outputs and decoder state.
        Parameters
        ----------
        decoder_hidden_state : ``torch.LongTensor``
            A tensor of shape ``(batch_size, decoder_output_dim)``, which contains the current decoder hidden state to be used
            as the 'query' to the attention computation
            during the last time step.
        encoder_outputs : ``torch.LongTensor``
            A tensor of shape ``(batch_size, max_input_sequence_length, encoder_output_dim)``, which contains all the
            encoder hidden states of the source tokens, i.e., the 'keys' to the attention computation
        encoder_mask : ``torch.LongTensor``
            A tensor of shape (batch_size, max_input_sequence_length), which contains the mask of the encoded input.
            We want to avoid computing an attention score for positions of the source with zero-values (remember not all
            input sentences have the same length)

        Returns
        -------
        (torch.Tensor, torch.Tensor)
            A tensor of shape (batch_size, encoder_output_dim) that contains the attended encoder outputs (aka context vector),
            i.e., we have ``applied`` the attention scores on the encoder hidden states.

        Notes
        -----
            Don't forget to apply the final softmax over the **masked** encoder outputs!
        """

        # Ensure mask is also a FloatTensor. Or else the multiplication within
        # attention will complain.
        # shape: (batch_size, max_input_sequence_length)
        encoder_outputs_mask = encoder_outputs_mask.float()

        attention_weights = encoder_outputs.bmm(
            decoder_hidden_state.unsqueeze(-1)).squeeze(-1)

        # Main body of attention weights computation here

        # decoder hidden state 1, 400
        # encoder outputs 1, 14, 400
        # encoder_outputs_mask = 1, 14

        attention_probs = masked_softmax(attention_weights,
                                         encoder_outputs_mask)
        # attention weights = 1, 14

        context_vector = util.weighted_sum(encoder_outputs, attention_probs)

        return context_vector, attention_probs