InterpretML - Alpha Release

In the beginning machines learned in darkness, and data scientists struggled in the void to explain them.

Let there be light.

InterpretML is an open-source python package for training interpretable models and explaining blackbox systems. Interpretability is essential for:

Model debugging - Why did my model make this mistake?
Detecting bias - Does my model discriminate?
Human-AI cooperation - How can I understand and trust the model's decisions?
Regulatory compliance - Does my model satisfy legal requirements?
High-risk applications - Healthcare, finance, judicial, ...

Historically, the most intelligible models were not very accurate, and the most accurate models were not intelligible. Microsoft Research has developed an algorithm called the Explainable Boosting Machine (EBM)^* which has both high accuracy and intelligibility. EBM uses modern machine learning techniques like bagging and boosting to breathe new life into traditional GAMs (Generalized Additive Models). This makes them as accurate as random forests and gradient boosted trees, and also enhances their intelligibility and editability.

Notebook for reproducing table

Dataset/AUROC	Domain	Logistic Regression	Random Forest	XGBoost	Explainable Boosting Machine
Adult Income	Finance	.907±.003	.903±.002	.922±.002	*.928±.002*
Heart Disease	Medical	.895±.030	.890±.008	.870±.014	*.916±.010*
Breast Cancer	Medical	*.995±.005*	.992±.009	*.995±.006*	*.995±.006*
Telecom Churn	Business	.804±.015	.824±.002	.850±.006	*.851±.005*
Credit Fraud	Security	.979±.002	.950±.007	*.981±.003*	.975±.005

In addition to EBM, InterpretML also supports methods like LIME, SHAP, linear models, partial dependence, decision trees and rule lists. The package makes it easy to compare and contrast models to find the best one for your needs.

* EBM is a fast implementation of GA²M. Details on the algorithm can be found here.

Installation

Python 3.5+ | Linux, Mac OS X, Windows

pip install -U interpret

Getting Started

Let's fit an Explainable Boosting Machine

from interpret.glassbox import ExplainableBoostingClassifier

ebm = ExplainableBoostingClassifier()
ebm.fit(X_train, y_train)

# EBM supports pandas dataframes, numpy arrays, and handles "string" data natively.

Understand the model

from interpret import show

ebm_global = ebm.explain_global()
show(ebm_global)

Understand individual predictions

ebm_local = ebm.explain_local(X_test, y_test)
show(ebm_local)

And if you have multiple models, compare them

show([logistic_regression, decision_tree])

Example Notebooks

Roadmap

Currently we're working on:

Multiclass Classification Support
Missing Values Support
Improved Categorical Encoding

...and lots more! Get in touch to find out more.

Contributing

If you are interested contributing directly to the code base, please see CONTRIBUTING.md.

Acknowledgements

InterpretML was originally created by (equal contributions): Samuel Jenkins & Harsha Nori & Paul Koch & Rich Caruana

Many people have supported us along the way. Check out ACKNOWLEDGEMENTS.md!

We also build on top of many great packages. Please check them out!

Contact us

There are multiple ways to get in touch:

Email us at interpret@microsoft.com
Or, feel free to raise a GitHub issue

Reporting Security Issues (we had to include this...)

Security issues and bugs should be reported privately, via email, to the Microsoft Security Response Center (MSRC) at secure@microsoft.com. You should receive a response within 24 hours. If for some reason you do not, please follow up via email to ensure we received your original message. Further information, including the MSRC PGP key, can be found in the Security TechCenter.

If a tree fell in your random forest, would anyone notice?

Name		Name	Last commit message	Last commit date
Latest commit History 549 Commits
benchmarks		benchmarks
core		core
examples/python		examples/python
python		python
tests/core		tests/core
.gitattributes		.gitattributes
.gitignore		.gitignore
ACKNOWLEDGEMENTS.md		ACKNOWLEDGEMENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
azure-pipelines.yml		azure-pipelines.yml
build.bat		build.bat
build.sh		build.sh
interpret.sln		interpret.sln

License

zhiwuya/interpret

Folders and files

Latest commit

History

Repository files navigation

InterpretML - Alpha Release

In the beginning machines learned in darkness, and data scientists struggled in the void to explain them.

Let there be light.

Installation

Getting Started

Example Notebooks

Roadmap

Contributing

Acknowledgements

Contact us

Reporting Security Issues (we had to include this...)

If a tree fell in your random forest, would anyone notice?

About

Resources

License

Stars

Watchers

Forks

Languages