Python utf8_chunkの例

プログラミング言語: Python

名前空間/パッケージ名: kobo.hub.models

メソッド/関数: utf8_chunk

hotexamples.comのコード掲載数: 8

Python utf8_chunk - 8件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのkobo.hub.models.utf8_chunkの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: test_utf8_chunk.py プロジェクト: release-engineering/kobo

    def test_fixup_end(self):
        """utf8_chunk returns copy of input aligned to nearest character boundary
        if input is a byte sequence truncated in the middle of a unicode character."""
        unistr = u'hello 世界'
        bytestr = unistr.encode('utf-8')

        # this is now a broken sequence since we cut it off
        # partway through a character
        bytestr = bytestr[:-1]

        # proving it's broken
        try_decode = lambda: bytestr.decode('utf-8')
        self.assertRaises(UnicodeDecodeError, try_decode)

        # utf8_chunk unbreaks it by removing until the previous
        # complete character
        self.assertEqual(utf8_chunk(bytestr).decode('utf-8'), u'hello 世')

コード例 #2

ファイルを表示

    def test_fixup_end(self):
        """utf8_chunk returns copy of input aligned to nearest character boundary
        if input is a byte sequence truncated in the middle of a unicode character."""
        unistr = u'hello 世界'
        bytestr = unistr.encode('utf-8')

        # this is now a broken sequence since we cut it off
        # partway through a character
        bytestr = bytestr[:-1]

        # proving it's broken
        try_decode = lambda: bytestr.decode('utf-8')
        self.assertRaises(UnicodeDecodeError, try_decode)

        # utf8_chunk unbreaks it by removing until the previous
        # complete character
        self.assertEqual(utf8_chunk(bytestr).decode('utf-8'), u'hello 世')

コード例 #3

ファイルを表示

 def test_noop_ascii(self):
     """utf8_chunk returns input bytes if byte sequence is entirely ASCII"""
     bytestr = b'hello world'
     self.assertIs(utf8_chunk(bytestr), bytestr)

コード例 #4

ファイルを表示

 def test_noop_invalid(self):
     """utf8_chunk returns input bytes if byte sequence is not valid
     UTF-8 and can't be fixed by truncation"""
     bytestr = b'hello \xff\xff\xff'
     self.assertIs(utf8_chunk(bytestr), bytestr)

コード例 #5

ファイルを表示

 def test_noop_utf8_mid(self):
     """utf8_chunk returns input bytes if byte sequence uses non-ASCII
     UTF-8 in the middle of the string and is well-formed"""
     unistr = u'hello 世界!'
     bytestr = unistr.encode('utf-8')
     self.assertIs(utf8_chunk(bytestr), bytestr)

コード例 #6

ファイルを表示

ファイル: test_utf8_chunk.py プロジェクト: release-engineering/kobo

 def test_noop_ascii(self):
     """utf8_chunk returns input bytes if byte sequence is entirely ASCII"""
     bytestr = b'hello world'
     self.assertIs(utf8_chunk(bytestr), bytestr)

コード例 #7

ファイルを表示

ファイル: test_utf8_chunk.py プロジェクト: release-engineering/kobo

 def test_noop_invalid(self):
     """utf8_chunk returns input bytes if byte sequence is not valid
     UTF-8 and can't be fixed by truncation"""
     bytestr = b'hello \xff\xff\xff'
     self.assertIs(utf8_chunk(bytestr), bytestr)

コード例 #8

ファイルを表示

ファイル: test_utf8_chunk.py プロジェクト: release-engineering/kobo

 def test_noop_utf8_mid(self):
     """utf8_chunk returns input bytes if byte sequence uses non-ASCII
     UTF-8 in the middle of the string and is well-formed"""
     unistr = u'hello 世界!'
     bytestr = unistr.encode('utf-8')
     self.assertIs(utf8_chunk(bytestr), bytestr)