Python word_tokenizeの例

プログラミング言語: Python

名前空間/パッケージ名: pytextparser

メソッド/関数: word_tokenize

hotexamples.comのコード掲載数: 6

Python word_tokenize - 6件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpytextparser.word_tokenizeの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

0

ファイルを表示

ファイル: test_parser.py プロジェクト: MichaelAquilina/pytextparser

 def test_ngrams(self):
     assert list(pytextparser.word_tokenize(text="foo bar bomb blar", ngrams=2)) == [
         ("foo", "bar"),
         ("bar", "bomb"),
         ("bomb", "blar"),
     ]

コード例 #2

0

ファイルを表示

ファイル: test_parser.py プロジェクト: MichaelAquilina/pytextparser

 def test_ignores_numeric(self):
     assert list(pytextparser.word_tokenize(text="one two 3 four")) == [("one",), ("two",), ("four",)]

コード例 #3

0

ファイルを表示

ファイル: test_parser.py プロジェクト: MichaelAquilina/pytextparser

 def test_min_length(self):
     assert list(pytextparser.word_tokenize(text="one for the money two for the go", min_length=4)) == [("money",)]

コード例 #4

0

ファイルを表示

ファイル: test_parser.py プロジェクト: MichaelAquilina/pytextparser

 def test_ignores_stopwords(self):
     assert list(
         pytextparser.word_tokenize(
             text="The first rule of python is", stopwords=set(["the", "of", "is"]), min_length=1
         )
     ) == [("first",), ("rule",), ("python",)]

コード例 #5

0

ファイルを表示

ファイル: test_parser.py プロジェクト: MichaelAquilina/pytextparser

 def test_splits_punctuation(self):
     assert list(pytextparser.word_tokenize(text="first. second")) == [("first",), ("second",)]

コード例 #6

0

ファイルを表示

ファイル: test_parser.py プロジェクト: MichaelAquilina/pytextparser

 def test_sentence(self):
     assert list(pytextparser.word_tokenize(text="hello cruel world")) == [("hello",), ("cruel",), ("world",)]