Python unicode_is_punctuationの例

プログラミング言語: Python

名前空間/パッケージ名: metanl.extprocess

メソッド/関数: unicode_is_punctuation

hotexamples.comのコード掲載数: 4

Python unicode_is_punctuation - 4件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのmetanl.extprocess.unicode_is_punctuationの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: allPythonContent.py プロジェクト: Mondego/pyreco

    def tag_and_stem(self, text, cache=None):
        """
        Given some text, return a sequence of (stem, pos, text) triples as
        appropriate for the reader. `pos` can be as general or specific as
        necessary (for example, it might label all parts of speech, or it might
        only distinguish function words from others).

        Twitter-style hashtags and at-mentions have the stem and pos they would
        have without the leading # or @. For instance, if the reader's triple
        for "thing" is ('thing', 'NN', 'things'), then "#things" would come out
        as ('thing', 'NN', '#things').
        """
        analysis = self.analyze(text)
        triples = []

        for record in analysis:
            root = self.get_record_root(record)
            token = self.get_record_token(record)

            if token:
                if unicode_is_punctuation(token):
                    triples.append((token, '.', token))
                else:
                    pos = self.get_record_pos(record)
                    triples.append((root, pos, token))
        return triples

コード例 #2

ファイルを表示

    def tag_and_stem(self, text, cache=None):
        """
        Given some text, return a sequence of (stem, pos, text) triples as
        appropriate for the reader. `pos` can be as general or specific as
        necessary (for example, it might label all parts of speech, or it might
        only distinguish function words from others).

        Twitter-style hashtags and at-mentions have the stem and pos they would
        have without the leading # or @. For instance, if the reader's triple
        for "thing" is ('thing', 'NN', 'things'), then "#things" would come out
        as ('thing', 'NN', '#things').
        """
        analysis = self.analyze(text)
        triples = []

        for record in analysis:
            root = self.get_record_root(record)
            token = self.get_record_token(record)

            if token:
                if unicode_is_punctuation(token):
                    triples.append((token, '.', token))
                else:
                    pos = self.get_record_pos(record)
                    triples.append((root, pos, token))
        return triples

コード例 #3

ファイルを表示

ファイル: allPythonContent.py プロジェクト: Mondego/pyreco

def test_unicode_is_punctuation():
    assert unicode_is_punctuation('word') is False
    assert unicode_is_punctuation('。') is True
    assert unicode_is_punctuation('-') is True
    assert unicode_is_punctuation('-3') is False
    assert unicode_is_punctuation('あ') is False

コード例 #4

ファイルを表示

def test_unicode_is_punctuation():
    assert unicode_is_punctuation('word') is False
    assert unicode_is_punctuation('。') is True
    assert unicode_is_punctuation('-') is True
    assert unicode_is_punctuation('-3') is False
    assert unicode_is_punctuation('あ') is False