Python word_tokenizeの例

プログラミング言語: Python

名前空間/パッケージ名: tokenize

メソッド/関数: word_tokenize

hotexamples.comのコード掲載数: 4

Python word_tokenize - 4件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのtokenize.word_tokenizeの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: running_median.py プロジェクト: datascientistone/insight-data-engineering-challenge

def main():
    # Read the list of files from the command line arguments
    # Make sure they are ordered in alphabetical order
    files = sorted(sys.argv[1:])
    file_input = fileinput.FileInput(files)

    running_median = median.RunningMedian()
    for line in file_input:
        words = list(tokenize.word_tokenize(line))
        running_median.add(len(words))
        print ('%.1f' % running_median.get_median())

コード例 #2

ファイルを表示

def main():
    # Read the list of files from the command line arguments
    # Make sure they are ordered in alphabetical order
    files = sorted(sys.argv[1:])
    file_input = fileinput.FileInput(files)

    running_median = median.RunningMedian()
    for line in file_input:
        words = list(tokenize.word_tokenize(line))
        running_median.add(len(words))
        print('%.1f' % running_median.get_median())

コード例 #3

ファイルを表示

def main():
    # Read the list of files from the command line arguments
    # Make sure they are ordered in alphabetical order
    files = sorted(sys.argv[1:])
    file_input = fileinput.FileInput(files)

    # Count frequencies using a Counter object based on Python's dict
    # For more memory efficient implementation we may use Trie data structure
    word_counter = collections.Counter()
    for line in file_input:
        for word in tokenize.word_tokenize(line):
            word_counter[word] += 1

    for word in sorted(word_counter.keys()):
        print('%s\t%s' % (word, word_counter[word]))

コード例 #4

ファイルを表示

ファイル: word_count.py プロジェクト: datascientistone/insight-data-engineering-challenge

def main():
    # Read the list of files from the command line arguments
    # Make sure they are ordered in alphabetical order
    files = sorted(sys.argv[1:])
    file_input = fileinput.FileInput(files)

    # Count frequencies using a Counter object based on Python's dict
    # For more memory efficient implementation we may use Trie data structure
    word_counter = collections.Counter()
    for line in file_input:
        for word in tokenize.word_tokenize(line):
            word_counter[word] += 1

    for word in sorted(word_counter.keys()):
        print ('%s\t%s' % (word, word_counter[word]))