Python feature_vector_from_email 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: spampy.email_processor

메소드/함수: feature_vector_from_email

hotexamples.com에서의 예제들: 3

Python feature_vector_from_email - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 spampy.email_processor.feature_vector_from_email에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: spam_classifier.py 프로젝트: wruochao19/spampy

def classify_email_with_enron(email):
    """
    Classify spam possibility of given email with enron dataset.
    Args:
      email (str):
        Raw e-mail.
    Returns:
      Spam or not.
    """

    vocablary_dict = email_processor.create_enron_dictionary()
    feature_vector = email_processor.feature_vector_from_email(
        email, vocablary_dict)
    double_dimesion_email = np.reshape(feature_vector, (-1, 3000))
    if os.path.exists('enron_features_matrix.npy'
                      ) == False & os.path.exists('enron_labels.npy') == False:
        features_matrix, labels = email_processor.extract_enron_features()
        np.save('enron_features_matrix.npy', features_matrix)
        np.save('enron_labels.npy', labels)
    else:
        features_matrix = np.load('enron_features_matrix.npy')
        labels = np.load('enron_labels.npy')
    X_train, _, y_train, _ = train_test_split(features_matrix,
                                              labels,
                                              test_size=0.40)
    linear_svc.fit(X_train, y_train)
    return linear_svc.predict(double_dimesion_email)

예제 #2

파일 보기

파일: test_email_processor.py 프로젝트: abdullahselek/spampy

 def test_feature_vector_from_email(self):
     email = "<*****@*****.**> Do You Want To Make $1000 Or More Per Week? https://github.com"
     vocablary_dict = email_processor.get_vocablary_dict()
     feature_vector = email_processor.feature_vector_from_email(
         email, vocablary_dict
     )
     self.assertEqual(len(feature_vector), 1899)

예제 #3

파일 보기

파일: spam_classifier.py 프로젝트: wruochao19/spampy

def classify_email(email):
    """
    Classify spam possibility of given email.
    Args:
      email (str):
        Raw e-mail.
    Returns:
      Spam or not.
    """

    train_svm()
    vocablary_dict = email_processor.get_vocablary_dict()
    feature_vector = email_processor.feature_vector_from_email(
        email, vocablary_dict)
    double_dimesion_email = np.reshape(feature_vector, (-1, 1899))
    spam_prediction = linear_svm.predict(double_dimesion_email)
    return spam_prediction