Python Segmentation.check_data 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: segmentation

클래스/타입: Segmentation

메소드/함수: check_data

hotexamples.com에서의 예제들: 1

Python Segmentation.check_data - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 segmentation.Segmentation.check_data에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Segmentation(30)

get_features(4)

get_padded(3)

build_segmentation_image(3)

compute_error(3)

addSegment(2)

beta(1)

buildImageAsBase64(1)

add_noise(1)

check_data(1)

contours_to_png(1)

create_features(1)

cut(1)

get_columns(1)

get_contours(1)

get_database(1)

addDiarizationSegment(1)

get_labels(1)

MInt(1)

예제 #1

파일 보기

파일: main.py 프로젝트: rpaudel42/GrowthPrediction

def main():
    segmentation = Segmentation()
    segmentation.check_data()

    print("\n\n Start Segmenting Users Based on RFM Methods ... ")

    segmentation.get_rfm_metric()

    segmentation.get_rfm_index()

    segmentation.transaction.apply(segmentation.define_rfm_segment, axis=1)

    segmentation.save_rfm_segment_pie_chart()
    segmentation.save_rfm_segment_scatter_plot()

    #label the segment generated by RFM methods
    segmentation.transaction['high_growth'] = segmentation.transaction.apply(segmentation.Label_Segments, axis=1)
    user_label = segmentation.transaction[['user', 'high_growth']]

    segmentation.save_user_group_bar_chart()
    print("\n\n Finished Segmenting Users Based on RFM Methods ... ")

    #---- Now start creating features for prediction -----
    print("\n\n Start Feature Generation for Classification ... ")
    final_dataset = segmentation.create_features()
    final_dataset = final_dataset.merge(user_label)
    print("\n\n Finished Feature Generation  ... ")

    classification = Classification()
    X = final_dataset.drop(columns=['user', 'high_growth'])
    y = final_dataset['high_growth']

    #initial prediction of high growth merchant. in full feature set
    print("\n\nLogistic Regression before feature selection")
    classification.run_logistic_regression(X, y)

    #Feature selection using correlation heatmap
    classification.correlation_heatmap(final_dataset)

    selected_feature = final_dataset.drop(
        columns=['monetary', 'fall_count', 'spring_count', 'summer_count', 'winter_count', 'spring_amt', 'summer_amt',
                 'winter_amt', 'fall_amt'])
    classification.correlation_heatmap(selected_feature)

    # New prediction on selected features
    X = selected_feature.drop(columns=['user', 'high_growth'])
    y = selected_feature['high_growth']

    print("\n\nLogistic Regression after feature selection")
    classification.run_logistic_regression(X, y)

    #Now run logistic regression on using cross val with k=10
    print("\n\nLogistic Regression using Cross Validation")
    classification.run_logistic_cross_val(X, y)

    print("\n\nLogistic Regression with SMOTE Resampling")
    classification.run_logistic_regression_with_resampling(X,y)

    print("\n\nRun Decision Tree")
    classification.run_decision_tree(X, y)

    print("\n\nRun Random Forest")
    y_pred_rf = classification.run_random_forest(X, y)

    print("\n\nRun Support Vector Machine")
    classification.run_svm(X, y)

    #finally print list of High Growth Merchant
    final_dataset['predicted'] = y_pred_rf
    high_growth_merchant = final_dataset[['user', 'monetary']].loc[final_dataset['predicted'] == True]

    high_growth_merchant[['user', 'monetary']].sort_values('monetary', ascending=False).to_csv(
        'high_growth_merchant.csv', index=False)

    print(
    "HIGH GROWTH MERCHANT as Given by RANDOM FOREST: \n", high_growth_merchant[['user', 'monetary']].sort_values('monetary', ascending=False))