Model Deployment

Scrips for trying out or deploying your models with different deeplearning inference engines.

Typical Model Deployment Pipeline

Folks in Lab often evaluate model in extract the same environment they trained it which is fast and requires little effort but not appropriate for efficient production deployment.

A typical model deployment pipeline in production scale usually involves steps bellows:

Train your model with your favorite deeplearning frameworks (tensorflow/pytorch/caffe)
Export your model to a frozen graph, which can be .pb for tensorflow or .onnx for pytorch.
Convert the frozen graph from step2

Supported inference engines:

Native Tensorflow
Native Tensorflow-keras
Tensorflow Lite
Onnx Runtime
Openvino
Tensorrt
MNN (TODO)
NCNN (TODO)

Usage

freeze your model and convert it to specific IR

# tf_graph_tookit.py/converter.py provide helpful functions for exporting your model and convert it to IR

Just try out your desired Inference engine

model_dir = '' # your converted IR
ie = SomeIE(model_dir, *args, **kwars)
input_data = None
result = ie.predict(None)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
inference_engine		inference_engine
converter.py		converter.py
readme.md		readme.md
tf_graph_toolkit.py		tf_graph_toolkit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

inference_engine

inference_engine

converter.py

converter.py

readme.md

readme.md

tf_graph_toolkit.py

tf_graph_toolkit.py

Repository files navigation

Model Deployment

Typical Model Deployment Pipeline

Supported inference engines:

Usage

About

Releases

Packages

Languages

zymale/model_deployment

Folders and files

Latest commit

History

Repository files navigation

Model Deployment

Typical Model Deployment Pipeline

Supported inference engines:

Usage

About

Resources

Stars

Watchers

Forks

Languages