Python tokenize示例

编程语言: Python

命名空间/包名称: indexer.indexer

方法/功能: tokenize

hotexamples.com的示例: 8

Python tokenize - 已找到8个示例。这些是从开源项目中提取的最受好评的indexer.indexer.tokenize现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_with_spaces():
    assert tokenize('Hello there friend') == ['Hello', 'there', 'friend']

示例#2

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_mix_alphanumeric():
    assert tokenize('123hello l33t__w0rds') == ['123hello', 'l33t', 'w0rds']

示例#3

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_with_unicode_symbols():
    assert tokenize('Emoji🤓are👍fun') == ['Emoji', 'are', 'fun']

示例#4

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_numbers():
    assert tokenize('123_345 90!22*66') == ['123', '345', '90', '22', '66']

示例#5

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_with_underscores():
    assert tokenize('So_many__underscores') == ['So', 'many', 'underscores']

示例#6

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_with_multiple_delimiters():
    assert tokenize('So    many    spaces') == ['So', 'many', 'spaces']

示例#7

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_with_mixed_characters():
    assert tokenize('This,sentence!is crazy') == [
        'This', 'sentence', 'is', 'crazy'
    ]

示例#8

显示文件

文件： tokenize_test.py 项目： jmillxyz/word-count

def test_tokenize_with_commas():
    assert tokenize('my,spacebar,is,broken') == [
        'my', 'spacebar', 'is', 'broken'
    ]