Skip to content

anqitu/kaggle-microsoft-malware-prediction

Repository files navigation

Microsoft Malware Prediction

Group

  • Group ID: 3
  • Group Members:
    • Tu Anqi (Team Leader)
    • Clarence Castillo
    • Tang Jiayun
    • Eko Edita Limanta
    • Andre Kristanto
    • Hans Albert Lianto

Instructions

  1. Download train.csv and test.csv from Kaggle Competition - Microsoft Malware Prediction
  2. Unzip the csv files, put them under the /data folder.
  3. Run pip install -r requirements.txt to install all required python libraries. ( Use Python version 3.6.8)
  4. (Optional) The script 1_check_data.py is to
    • generate data summary (missing frequency, value counts for categorical data, etc.), and
    • generate plots to visualize data (boxplot for numeric data, histogram for categorical data, etc.)
  5. (Optional) Run the script 2_analyze_data.py to
    • perform data analytics, and
    • check rationales behind data preprocessing steps in the experiment for next step
  6. Run the script 3_experiment.py to conduct experiment on the prediction task
  7. (Optional) Run the script 4_compare_performance.py to compare performance of all models
  8. Run the script 5_submit.py to predict for the Kaggle test set for submission

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages