Python standardize_text Examples

Programming Language: Python

Namespace/Package Name: neuroquery.tokenization

Method/Function: standardize_text

Examples at hotexamples.com: 2

Python standardize_text - 2 examples found. These are the top rated real world Python examples of neuroquery.tokenization.standardize_text extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

def test_get_standardizing_inverse():
    std_inv = tokenization.get_standardizing_inverse(
        VOCABULARY_FILE,
        lambda t: tokenization.standardize_text(t, stemming="porter_stemmer"),
    )
    assert std_inv["memori"] == "memory"
    assert std_inv["work memori"] == "working memory"
    assert std_inv["nerv"] == "nerves"

Example #2

Show file

def test_standardize_text():
    text = "One a the Word abcd-eft: --\nhello\t 1240"
    assert (
        tokenization.standardize_text(text) == "one word abcd eft hello 1240")