Python Dictionary.get_token_indexの例

プログラミング言語: Python

名前空間/パッケージ名: fairseq.data.dictionary

クラス/型: Dictionary

メソッド/関数: get_token_index

hotexamples.comのコード掲載数: 5

Python Dictionary.get_token_index - 5件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのfairseq.data.dictionary.Dictionary.get_token_indexの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

load(18)

Dictionary(17)

add_token_to_namespace(8)

index(7)

get_token_index(5)

bos(3)

string(3)

eos(3)

save(2)

pad(2)

pad_word(1)

pad_index(1)

from_params(1)

indices(1)

add_symbol(1)

finalize(1)

eos_word(1)

eos_index(1)

bos_word(1)

bos_index(1)

symbols(1)

コード例 #1

ファイルを表示

 def tokens_to_indices(
     self, tokens: List[Token], vocabulary: Dictionary, index_name: str
 ) -> Dict[str, List[int]]:  # pylint: disable=unused-argument
     return {
             "token_ids": [10, 15] + \
                       [vocabulary.get_token_index(token.text, 'words') for token in tokens] + \
                       [25],
             "additional_key": [22, 29]
     }

コード例 #2

ファイルを表示

ファイル: single_id_token_indexer.py プロジェクト: naetherm/NSEC_NMT

    def tokens_to_indices(self, tokens: List[Token], vocabulary: Dictionary,
                          index_name: str):
        indices: List[int] = []

        for token in itertools.chain(self.start_tokens, tokens,
                                     self.end_tokens):
            if getattr(token, 'text_id', None) is not None:
                # `text_id` being set on the token means that we aren't using the vocab, we just use
                # this id instead.
                indices.append(token.text_id)
            else:
                text = token.text
                if self.lowercase_tokens:
                    text = text.lower()
                indices.append(vocabulary.get_token_index(
                    text, self.namespace))

        return {index_name: indices}

コード例 #3

ファイルを表示

    def tokens_to_indices(self, tokens: List[Token], vocabulary: Dictionary,
                          index_name: str) -> Dict[str, List[List[int]]]:
        indices: List[List[int]] = []

        # Combine the tokens with start and end tokens
        for token in itertools.chain(self.start_tokens, tokens,
                                     self.end_tokens):
            token_indices: List[int] = []
            if token.text is None:
                pass  # ERROR
            else:
                for c in self.character_tokenizer.tokenize(token.text):
                    if getattr(c, 'text_id', None) is not None:
                        idx = c.text_id
                    else:
                        idx = vocabulary.get_token_index(
                            c.text, self.namespace)
                    token_indices.append(idx)
                indices.append(token_indices)

        return {index_name: indices}

コード例 #4

ファイルを表示

ファイル: sequence_label_field.py プロジェクト: naetherm/NSEC_NMT

 def index(self, vocab: Dictionary):
     if self.indexed_labels is None:
         self.indexed_labels = [
             vocab.get_token_index(l, self.label_namespace)
             for l in self.labels
         ]

コード例 #5

ファイルを表示

 def index(self, vocab: Dictionary):
   if self.label_id is None:
     self.label_id = vocab.get_token_index(self.label, self.label_namespace)  # type: ignore