Python Pre_Processing.remove_stopwordsの例

プログラミング言語: Python

名前空間/パッケージ名: TextPreprocessing

クラス/型: Pre_Processing

メソッド/関数: remove_stopwords

hotexamples.comのコード掲載数: 2

Python Pre_Processing.remove_stopwords - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのTextPreprocessing.Pre_Processing.remove_stopwordsの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

lower_case(6)

clean_text(5)

remove_punctuation(5)

lemmatize_words(2)

remove_stopwords(2)

tokenization(2)

コード例 #1

ファイルを表示

def TFIDF_pre_proc(original_corpus, suspicious_corpus):
    pre_processed_files = []
    sus = []
    orig = []
    for text in original_corpus:
        original = Pre_Processing.lower_case(text)
        original = Pre_Processing.remove_punctuation(original)
        original = Pre_Processing.clean_text(original)
        original = Pre_Processing.tokenization(original)
        original = Pre_Processing.remove_stopwords(original)
        original = Pre_Processing.lemmatize_words(original)
        orig.append(original)
    pre_processed_files.append(orig)

    for text in suspicious_corpus:
        suspicious = Pre_Processing.lower_case(text)
        suspicious = Pre_Processing.remove_punctuation(suspicious)
        suspicious = Pre_Processing.clean_text(suspicious)
        suspicious = Pre_Processing.tokenization(suspicious)
        suspicious = Pre_Processing.remove_stopwords(suspicious)
        suspicious = Pre_Processing.lemmatize_words(suspicious)
        sus.append(suspicious)
    pre_processed_files.append(sus)
    print("TFIDF Pre-Processing Complete")
    return pre_processed_files

コード例 #2

ファイルを表示

ファイル: Internal_Main.py プロジェクト: pombredanne/Django_Project

def NGRAM_pre_proc(suspicious_corpus):
    pre_processed_files = []
    for text in suspicious_corpus:
        suspicious = Pre_Processing.lower_case(text)
        suspicious = Pre_Processing.remove_punctuation(suspicious)
        suspicious = Pre_Processing.clean_text(suspicious)
        suspicious = Pre_Processing.tokenization(suspicious)
        suspicious = Pre_Processing.remove_stopwords(suspicious)
        suspicious = Pre_Processing.lemmatize_words(suspicious)
        pre_processed_files.append(suspicious)
    print("NGram Overlap Pre-Processing Complete")
    return pre_processed_files