Python standardize_text示例

编程语言: Python

命名空间/包名称: neuroquery.tokenization

方法/功能: standardize_text

hotexamples.com的示例: 2

Python standardize_text - 已找到2个示例。这些是从开源项目中提取的最受好评的neuroquery.tokenization.standardize_text现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

def test_get_standardizing_inverse():
    std_inv = tokenization.get_standardizing_inverse(
        VOCABULARY_FILE,
        lambda t: tokenization.standardize_text(t, stemming="porter_stemmer"),
    )
    assert std_inv["memori"] == "memory"
    assert std_inv["work memori"] == "working memory"
    assert std_inv["nerv"] == "nerves"

示例#2

显示文件

def test_standardize_text():
    text = "One a the Word abcd-eft: --\nhello\t 1240"
    assert (
        tokenization.standardize_text(text) == "one word abcd eft hello 1240")