Skip to content

EdwardBetts/awesome-kagg-ml

 
 

Repository files navigation

Driver Telematics Analysis - Team PIKKI

The code reached an AUC of 0.8850 on the public leaderboard on kaggle for the Driver Telematic Analysis .

The code is structured as follows:

  1. Step_1_Create_DataFrame.py:

In this step, the entire kaggle data set is read into data frames and saved in HDF5 files. This step only has to be done once, and reduces the time required for Step 2 and 3.

  1. Step_2_Extract_Features.py:

Here, features for each trip are extracted, put in a data frame and saved in HDF5 format.

  1. Step_3_Classify.py:

In this step, a supervised learning approach (Random Forest Classifier) is used. For this, a classifier is trained for each driver as positive set and 200 random trips from other drivers are used as negative training set.

About

ML in Practice - Kaggle Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 63.5%
  • Python 30.1%
  • Jupyter Notebook 2.9%
  • MATLAB 2.7%
  • Other 0.8%