Skip to content

mesgarpour/ERMER

Repository files navigation




The Ensemble model

Linux Debian Linux Fedora Mac OS Windows
CircleCI CircleCI CircleCI CircleCI

The ERMER Notebook:

Windows
CircleCI

Version: 1.1 (2017.12.16)
Supported Python Version: 3.5

Ensemble Risk Model of Emergency Admissions (ERMER) is an optimised ensemble of sub-models that are trained using Bayes Point Machine (BPM). The features of the model are generated using the Healthcare Pre-Processing Framework, but it is not integrated into the ERMER development toolkit, in order to preserve the tool's generic structure. The ERMER development toolkit is a generic, user-friendly and open-source software package that can be used for development of temporal comorbidity index independent of source of healthcare data..



Introduction

About half of hospital readmissions can be avoided with preventive interventions. Developing decision support tools for identification of patients' emergency readmission risk is an important area of research. Because, it remains unclear how to design features and develop predictive models that can adjust continuously to a fast-changing healthcare system and population characteristics. The objective of this study was to develop a generic Ensemble of Bayesian emergency readmission risk models.

Most existing decision support tools, that are based on hospital administrative data, use Logistic Regression or Coxian Phase-type Distribution models, and have very limited capability. This phase of research develops an Ensemble generative risk model of emergency readmission within a year to the England's hospitals. The Machine Learning Ensemble method is a powerful technique, which uses a finite set of weaker models and an algorithm to combine and optimise the performance of the Ensemble model.

An Ensemble of generated BPM models of emergency readmission has been developed, which is based on a collection of sub-models that are conditioned on different population characteristics. The proposed model, Ensemble Risk Model of Emergency Admissions (ERMER), utilises a weighted average ranking method to optimise the weights of sub-classifier using a bidirectional hill-climbing heuristic. The novelty lies in the intuitive adaptation of an Ensemble modelling with a generative approach for prediction of patients' risks. Moreover, the Ensemble of specialised sub-models for prediction of patients risks has not been addressed with existing studies.

In this research, Microsoft's Infer.Net library was used to construct the BPM model. The applied algorithm uses the original version of the BPM, with two main modifications. Firstly, it uses a mixture of Gamma-Gamma, a heavy-tailed prior probability distribution for the precision of weights and features. Secondly, it applies Expectation Propagation message passing to infer posterior probabilities, which has been demonstrated in Gaussian Mixture problems to be better than approximation techniques.

Therefore, the applied BPM is invariant to parameter rescaling or shifting, unlikeLogistic Regression or Support Vector Machine (SVM) methods. Moreover, active Bayesian training can allow continuous updates of the model and account for changes in the prior probabilities. Furthermore, the BPM can efficiently handle a relatively larger number of features.

Performance

Based on the defined Healthcare Pre-Processing Framework introduced in the Phase-I of our research, features were generated, filtered, and ranked. Thereafter, a number of sub-models based on population characteristics were trained using a BPM approach. Afterwards, an optimised Ensemble model of these sub-models was generated. The developed ERMER was trained and tested using three time-frames: 1999-2004, 2000-05, and 2004-09, each of which includes 20% of patients admitted within the trigger-year. In addition, a development toolkit is supplemented to ease the validation and adaptation of the ERMER.

Comparisons are made for different time-frames, sub-populations, risk cut-offs, risk bands, and top risk segments. The precision was 71.6% to 73.9%, the specificity was 88.3% to 91.7%, and the sensitivity was 42.1% to 49.2% across different time-frames. Moreover, the area under the curve was 75.9% to 77.1%.

The proposed decision support tool performed considerably better than the previous modelling approaches, and it was robust and stable with high precision. Moreover, the Healthcare Pre-Processing Framework and the Ensemble Bayesian approach allow the ERMER to continuously be adjusted to new significant features, different population characteristics and changes in the system.

Getting Started

The development toolkit consists of two parts:

The Notebooks include detailed guides about the ERMER configurations, and it is highly recommended to adjust them based on your inputted features and hardware settings.

Moreover, a sample is provided for demonstration purpose.

Related Publications

Licenses

Credits

Original Author: Mohsen Mesgarpour, Health and Social Care Modelling Group (HSCMG), University of Westminster.

Most Recent Author: Mohsen Mesgarpour, Health and Social Care Modelling Group (HSCMG), University of Westminster.

About

Ensemble Risk Model of Emergency Readmission

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages