Skip to content

yizhiai1994/predictive-monitoring-benchmark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Benchmark for outcome-oriented predictive process monitoring

Supplementary material for the article "Outcome-Oriented Predictive Process Monitoring: Review and Benchmark" by Irene Teinemaa, Marlon Dumas, Marcello La Rosa, and Fabrizio Maria Maggi.

This repository provides implementations of different techniques for outcome-oriented predictive business process monitoring. The aim of these techniques is to predict a (pre-defined) binary case outcome of a running (partial) trace.

The benchmark includes implementations of four sequence encodings (for further description, refer to the paper):

  • Static encoding
  • Last state encoding
  • Aggregation encoding
  • Index-based encoding

Moreover, the repository contains implementations of four bucketing methods (see the paper for more details):

  • No bucketing
  • KNN
  • State-based
  • Clustering
  • Prefix length based

The benchmark experiments have been performed using four classifiers:

  • Random forest
  • Gradient boosted trees (XGBoost)
  • Logistic regression
  • SVM

Together with the code, we make available 22 datasets that were used in the evaluation section in the paper (2 datasets used in the paper are private). These datasets correspond to different prediction tasks, formulated on 8 publicly available event logs (namely, the BPIC 2011, BPIC 2012, BPIC 2015, BPIC 2017, Sepsis Cases, Hospital Billing, Road Traffic Fine Management, Production log event logs). These (labeled and preprocessed) benchmark datasets can be found at https://drive.google.com/open?id=154hcH-HGThlcZJW5zBvCJMZvjOQDsnPR.

If you use code from this repository, please cite the following paper:

@article{Teinemaa2017,
   author = {{Teinemaa}, I. and {Dumas}, M. and {La Rosa}, M. and {Maggi}, F.~M.},
    title = "{Outcome-Oriented Predictive Process Monitoring: Review and Benchmark}",
  journal = {ArXiv e-prints},
archivePrefix = "arXiv",
   eprint = {1707.06766},
 primaryClass = "cs.AI",
 keywords = {Computer Science - Artificial Intelligence},
     year = 2017,
    month = jul,
   adsurl = {http://adsabs.harvard.edu/abs/2017arXiv170706766T},
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

About

Benchmark evaluation for outcome-based predictive monitoring

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 96.2%
  • Jupyter Notebook 3.8%