Skip to content

Transaction fraud detection using machine learning and nature inspired algorithms.

Notifications You must be signed in to change notification settings

pmacinec/transactions-fraud-detection

Repository files navigation

Transactions Fraud Detection

Authors: Peter Macinec, Timotej Zatko

Prerequisites

Installation and running

To run this project, please make sure you have Docker installed. After, follow the steps:

  1. Get into project root repository.
  2. Setup repository using command:
    sh ./scripts/setup.sh
    
  3. Download data (you need to have Kaggle API installed). Don't for get to accept the rules of competition.
    sh ./scripts/download_data.sh
    
  4. Build docker image:
    sh ./scripts/build.sh
    
  5. Run docker container using command:
    sh ./scripts/run.sh
    
  6. Run initial dataset processing:
    1. Get into docker container:
      docker exec -it transactions_fraud_detection_con bash    
      
    2. Run script with data preprocessing:
      python src/preprocessing/initial_preprocessing.py
      
      Note: You can run this script also outside the container, but make sure you have python with pandas library installed.

Dataset

To solve our problem of transactions fraud detection, we are using the data from IEEE-CIS Fraud Detection competition at Kaggle. The large-scale dataset contains real-world e-commerce transactions with a variety of features included. Features are of different types. describing transactions, products or other useful information (such as categorical DeviceType). Many of the features are not described properly, so results explanation will not be so clear.

Paper

Paper LaTex source files are available in the paper dircetory. The readable document in pdf format is in the following location -- paper/main.pdf.

Presentation

Presentation describing whole project and results is available at presentation.pdf.

Releases

No releases published

Packages

No packages published

Languages