GitHub - Sapphirine/Santander-Customer-Satisfaction: projectID:201712-27 Team members:jc4805, jq2261, ll2873

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
features		features
input		input
output		output
rgf1.2		rgf1.2
submission		submission
.gitattributes		.gitattributes
Readme.txt		Readme.txt
kmeans_features.py		kmeans_features.py
models_combine.r		models_combine.r
pca_features.py		pca_features.py
rgf.py		rgf.py
santander_models.pyc		santander_models.pyc
santander_preprocess.py		santander_preprocess.py
santander_preprocess.pyc		santander_preprocess.pyc
train_models.py		train_models.py
tsne_features.py		tsne_features.py

Repository files navigation

Here are the steps to run the program:
During running of the program, you might be asked to install packages including but not limited to sk-learn, spark-sklearn package, pandas, metrics.
1. Go to the folder path project_folder/code
2. Move the data to project_folder /code/input
3. Extract features by running extract_features.sh
4. Confirm the features are saved in folder project_folder /code/features
5. Run train_models.py to train different models using different combo of classifiers and features
6 Run models_combine.r to combines results from different modesl in a way that optimizes the area under the ROC curve (AUC).
7. Find the results in folder project/code/submission

The following content expains what each source code do:
main interface:
models_combine.r: combines results from different modesl in a way that optimizes the area under the ROC curve (AUC).
train_models.py: train different models using different combo of classifiers and features
basic source code:
santander_preprocess.py: preprocess the raw data
tsne_features.py: produce tsne features 
pca_features.py: produce two PCA features (# of pca can be edited)
kmeans_features.py: produce kmeans features with 2-10 clusters
rgf.py: classification algorithm regularized greedy forest

About

projectID:201712-27 Team members:jc4805, jq2261, ll2873

Readme

Activity

Custom properties

0 stars

13 watching

0 forks

Report repository

Releases

No releases published

Packages

No packages published

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

features

features

input

input

output

output

rgf1.2

rgf1.2

submission

submission

.gitattributes

.gitattributes

Readme.txt

Readme.txt

kmeans_features.py

kmeans_features.py

models_combine.r

models_combine.r

pca_features.py

pca_features.py

rgf.py

rgf.py

santander_models.pyc

santander_models.pyc

santander_preprocess.py

santander_preprocess.py

santander_preprocess.pyc

santander_preprocess.pyc

train_models.py

train_models.py

tsne_features.py

tsne_features.py

Repository files navigation

About

Releases

Packages

Languages

Sapphirine/Santander-Customer-Satisfaction

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages