Python SentencePieceTokenizer.ids_to_text Examples

Programming Language: Python

Namespace/Package Name: nemo.collections.common.tokenizers.sentencepiece_tokenizer

Method/Function: ids_to_text

Examples at hotexamples.com: 2

Python SentencePieceTokenizer.ids_to_text - 2 examples found. These are the top rated real world Python examples of nemo.collections.common.tokenizers.sentencepiece_tokenizer.SentencePieceTokenizer.ids_to_text extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

SentencePieceTokenizer(12)

text_to_tokens(7)

add_special_tokens(4)

text_to_ids(4)

token_to_id(3)

ids_to_text(2)

ids_to_tokens(2)

Example #1

Show file

    def test_ids_to_text(self, test_data_dir):
        tokenizer = SentencePieceTokenizer(test_data_dir + self.model_name)

        text = "<cls> a b c <sep> e f g h i </s>"
        ids = tokenizer.text_to_ids(text)
        result = tokenizer.ids_to_text(ids)

        assert text == result

Example #2

Show file

File: test_spc_tokenizer.py Project: titu1994/NeMo

    def test_ids_to_text(self, test_data_dir):
        tokenizer = SentencePieceTokenizer(test_data_dir + self.model_name)
        special_tokens = MODEL_SPECIAL_TOKENS
        tokenizer.add_special_tokens(special_tokens)

        text = "[CLS] a b c [MASK] e f [SEP] g h i [SEP]"
        ids = tokenizer.text_to_ids(text)
        result = tokenizer.ids_to_text(ids)

        assert text == result