Python Line.add_token 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: sciwing.data.line

클래스/타입: Line

메소드/함수: add_token

hotexamples.com에서의 예제들: 2

Python Line.add_token - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 sciwing.data.line.Line.add_token에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Line(25)

add_token(2)

자주 사용되는 메소드들

Line (25)

add_token (2)

예제 #1

파일 보기

 def _generate_lines_with_start_token(self):
     line = Line("")
     line.add_token(self.start_token, "tokens")
     return line

예제 #2

파일 보기

    def forward(
        self,
        lines: List[Line],
        c0: torch.FloatTensor,
        h0: torch.FloatTensor,
        encoder_outputs: torch.FloatTensor = None,
        teacher_forcing_ratio: float = 0,
    ) -> torch.Tensor:
        """

        Parameters
        ----------
        lines : list of Line objects
            Batched tokenized source sentence of shape [batch size].

        h0, c0 : 3d torch.FloatTensor
            Hidden and cell state of the LSTM layer. Each state's shape
            [n layers * n directions, batch size, hidden dim]

        Returns
        -------
        prediction : 2d torch.LongTensor
            For each token in the batch, the predicted target vobulary.
            Shape [batch size, output dim]

        hn, cn : 3d torch.FloatTensor
            Hidden and cell state of the LSTM layer. Each state's shape
            [n layers * n directions, batch size, hidden dim]
        """
        use_teacher_forcing = (True if
                               (random.random() < teacher_forcing_ratio) else
                               False)
        if use_teacher_forcing:
            max_length = max(len(line.tokens["tokens"]) for line in lines)
        else:
            max_length = self.max_length
        batch_size = len(lines)

        # tensor to store decoder's output
        outputs = torch.zeros(max_length, batch_size,
                              self.vocab_size).to(self.device)

        # last hidden & cell state of the encoder is used as the decoder's initial hidden state
        if use_teacher_forcing:
            prediction, _, _ = self.forward_step(
                lines=lines, h0=h0, c0=c0, encoder_outputs=encoder_outputs)
            outputs[1:] = prediction.permute(1, 0, 2)[:-1]
        else:
            lines = [self._generate_lines_with_start_token()] * batch_size
            for i in range(1, max_length):
                prediction, hn, cn = self.forward_step(
                    lines=lines, h0=h0, c0=c0, encoder_outputs=encoder_outputs)
                prediction = prediction.squeeze(1)
                outputs[i] = prediction
                line_token_indexes = prediction.argmax(1)
                line_tokens = [
                    self.vocab.idx2token[line_token_index]
                    for line_token_index in line_token_indexes.cpu().numpy()
                ]
                lines = []
                for token in line_tokens:
                    line = Line("")
                    line.add_token(token, "tokens")
                    lines.append(line)
                h0, c0 = hn, cn
        outputs = outputs.permute(1, 0, 2)
        return outputs