Skip to content

Code for "Debugging Machine Learning Pipelines". the Algorithms of MLDebugger are now part of BugDoc, a general pipeline debugger framework. This repository will keep the experiments for the DEEM paper, please check BugDoc's repository for the latest developments.

License

smduan/MLDebugger

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MLDebugger Build Status

MLDebugger is a framework for finding root causes of errors in machine learning pipelines. For more detailed information about the framework, please refer to our DEEM paper:

Debugging Machine Learning Pipelines. R. Lourenco, J. Freire, and D. Shasha. In Proceedings of the 3rd International Workshop on Data Management for End-to-End Machine Learning (DEEM),2019

The team includes:

We strongly suggest users to read our paper before using our code.

1. How To Build

To install latest development version:

$ pip install -e .

2. How to Run

To run our example with a machine learning pipeline written in VisTrail, first you need to start a worker:

$ worker &

Then run the executable passing the pipeline path and property-value search space:

$ mldebugger examples/classification_pipeline.vt examples/params.json

More detailed documentation is coming soon.

About

Code for "Debugging Machine Learning Pipelines". the Algorithms of MLDebugger are now part of BugDoc, a general pipeline debugger framework. This repository will keep the experiments for the DEEM paper, please check BugDoc's repository for the latest developments.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%