Python normalize Examples

Programming Language: Python

Namespace/Package Name: metanl.mecab

Method/Function: normalize

Examples at hotexamples.com: 4

Python normalize - 4 examples found. These are the top rated real world Python examples of metanl.mecab.normalize extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: allPythonContent.py Project: Mondego/pyreco

def normalize_topic(topic):
    """
    Get a canonical representation of a Wikipedia topic, which may include
    a disambiguation string in parentheses.

    Returns (name, disambig), where "name" is the normalized topic name,
    and "disambig" is a string corresponding to the disambiguation text or
    None.
    """
    # find titles of the form Foo (bar)
    topic = topic.replace('_', ' ')
    match = re.match(r'([^(]+) \(([^)]+)\)', topic)
    if not match:
        return normalize(topic), None
    else:
        return normalize(match.group(1)), 'n/' + match.group(2).strip(' _')

Example #2

Show file

def normalize_topic(topic):
    """
    Get a canonical representation of a Wikipedia topic, which may include
    a disambiguation string in parentheses.

    Returns (name, disambig), where "name" is the normalized topic name,
    and "disambig" is a string corresponding to the disambiguation text or
    None.
    """
    # find titles of the form Foo (bar)
    topic = topic.replace('_', ' ')
    match = re.match(r'([^(]+) \(([^)]+)\)', topic)
    if not match:
        return normalize(topic), None
    else:
        return normalize(match.group(1)), 'n/' + match.group(2).strip(' _')

Example #3

Show file

File: allPythonContent.py Project: Mondego/pyreco

def test_japanese():
    eq_(normalize('これはテストです'), 'テスト')
    this_is_a_test = [('これ', '~名詞', 'これ'),
                      ('は', '~助詞', 'は'),
                      ('テスト', '名詞', 'テスト'),
                      ('です', '~助動詞', 'です'),
                      ('。', '.', '。')]
    eq_(tag_and_stem('これはテストです。'), this_is_a_test)

Example #4

Show file

def test_japanese():
    eq_(normalize('これはテストです'), 'テスト')
    this_is_a_test = [('これ', '~名詞', 'これ'), ('は', '~助詞', 'は'),
                      ('テスト', '名詞', 'テスト'), ('です', '~助動詞', 'です'),
                      ('。', '.', '。')]
    eq_(tag_and_stem('これはテストです。'), this_is_a_test)