Python Token.original_wordの例

プログラミング言語: Python

名前空間/パッケージ名: token_class

クラス/型: Token

メソッド/関数: original_word

hotexamples.comのコード掲載数: 2

Python Token.original_word - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのtoken_class.Token.original_wordの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Token(30)

from_address(2)

original_word(2)

delete_tmp_token(1)

from_symbol(1)

get_tmp_token(1)

is_stopword(1)

lower(1)

replaced_word(1)

token2dig(1)

word_without_punctuations(1)

コード例 #1

ファイルを表示

def tokenize(input_text):
    nltk.download('stopwords')
    stopwords = set(nltk.corpus.stopwords.words('english'))

    sentences = split_input_text_into_sentences(input_text)

    all_tokens_of_all_sentences = []

    for sent_index, sent in enumerate(sentences):
        tokens_in_this_sentence = []

        for word_index, word in enumerate(sent.split()):
            token = Token()
            token.original_word = word
            token.word_without_punctuations = remove_surrounding_punctuations(
                word).lower()

            if token.word_without_punctuations in stopwords:
                token.is_stopword = True
            else:
                token.is_stopword = False

            tokens_in_this_sentence.append(token)

        all_tokens_of_all_sentences.append(tokens_in_this_sentence)

    set_parts_of_speech_in_tokens(all_tokens_of_all_sentences)

    return all_tokens_of_all_sentences

コード例 #2

ファイルを表示

ファイル: output.py プロジェクト: avaneesh93/defeating-plagiarism-checkers

                                                     token.original_word)

                output_text += replaced_word + " "

            elif token.original_word:
                output_text += token.original_word + " "

    return output_text.strip()


if __name__ == '__main__':
    rep = "excellent"
    orig = ",Amazing."

    rep = restore_case(rep, remove_surrounding_punctuations(orig))
    rep = restore_punctuations(rep, orig)

    print(rep)

    t1 = Token()
    t1.original_word = 'This'

    t2 = Token()
    t2.original_word = 'is'

    t3 = Token()
    t3.original_word = 'amazing!'
    t3.replaced_word = 'awesome'

    print(generate_output_text_from_tokens([[t1, t2, t3]]))