Python Speech2Text2Tokenizer Examples

Programming Language: Python

Namespace/Package Name: transformers.models.speech_to_text_2

Examples at hotexamples.com: 2

Python Speech2Text2Tokenizer - 2 examples found. These are the top rated real world Python examples of transformers.models.speech_to_text_2.Speech2Text2Tokenizer extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

from_pretrained(2)

Frequently Used Methods

from_pretrained (2)

Example #1

Show file

File: test_tokenization_speech_to_text_2.py Project: Kevin-Zhao-Github/oLMpics

    def test_load_no_merges_file(self):
        tokenizer = Speech2Text2Tokenizer.from_pretrained(self.tmpdirname)

        with tempfile.TemporaryDirectory() as tmp_dirname:
            tokenizer.save_pretrained(tmp_dirname)
            os.remove(os.path.join(tmp_dirname, "merges.txt"))

            # load tokenizer without merges file should not throw an error
            tokenizer = Speech2Text2Tokenizer.from_pretrained(tmp_dirname)

        with tempfile.TemporaryDirectory() as tmp_dirname:
            # save tokenizer and load again
            tokenizer.save_pretrained(tmp_dirname)
            tokenizer = Speech2Text2Tokenizer.from_pretrained(tmp_dirname)

        self.assertIsNotNone(tokenizer)

Example #2

Show file

File: test_tokenization_speech_to_text_2.py Project: vuhluu/transformers

    def test_tokenizer_decode(self):
        tokenizer = Speech2Text2Tokenizer.from_pretrained(self.tmpdirname)

        # make sure @@ is correctly concatenated
        token_ids = [4, 6, 8, 7, 10]  # ["here@@", "couple", "words", "of@@", "the"]
        output_string = tokenizer.decode(token_ids)

        self.assertTrue(output_string == "herecouple words ofthe")