GitHub - shiyuangogogo/TCT_versionII-master: Optimize Resnet-50 classification model with TensorRT3.0 for speeding up the inference process of Cervical cancer prediction

TCT Program Inference Speeding up

Environment：

Hardware：
CPU：Intel(R) Xeon(R) Gold 6138 CPU @ 2.00GHz
MEM：256GB
GPU：Tesla P100
NETWORK：56Gbps InfiniBand

Software：

OS：CentOS 7.2
Kernel：3.10.0-327.el7.x86_64
Tensorflow：tensorflow-1.8.0-cp27-cp27mu-linux_x86_64.whl
Python：2.7.5
TensorRT: TensorRT-3.0.4.Ubuntu-5. 14.04.5.x86_64.cuda-9.0.cudnn7.0.tar.gz 6. Cuda：v9.0.176
Cudnn：v7.1.4.18
Nvidia Driver：v384.81
Ceph：v12.2.1

Additional libraries:

yum install openslide
pip install openslide-python==1.1.1
pip install h5py==2.6.0
pip install keras==2.1.2

Load saved keras weights and convert to tensorflow backend

Our cervical cancer prediction program adopted the model trained with keras framework, due to the tensorrt subgraph convertion error, error information are shown follows:

2018-05-23 16:30:20.805135: W tensorflow/contrib/tensorrt/convert/convert_graph.cc:412] subgraph conversion error for subgraph_index:1 due to: "Invalid argument: Node alexnet/conv1/BiasAdd should have an input named 'alexnet/conv1/biases' but it is not available" SKIPPING......( 56 nodes)
2018-05-23 16:30:20.807891: W tensorflow/contrib/tensorrt/convert/convert_graph.cc:412] subgraph conversion error for subgraph_index:2 due to: "Unimplemented: Require 4 dimensional input. Got 2 alexnet/fc1/MatMul" SKIPPING......( 2 nodes)

Then, we attemp to load saved keras weights and convert to resnet50 with tensorflow backend,

$ python classificaiton/resnet50

Freezing the exported Graph

If you then want to use the resulting model with your own or pretrained checkpoints as part of a mobile model, you can run freeze_inference_graph to get a graph def with the variables inlined as constants using:

$ bash freeze_inference_graph.sh

Optimize the Resnet-50 with TensorRT and Run the reference

TensorRT is NVIDIA's inference optimizer for deep learning. Briefly, TensorRT rewrites parts of the execution graph to allow for faster prediction times.

You have TensorFlow, TensorRT, a graph def, and a picture. Now it's time to time.

For the full set of possible parameters, you can run python tensorrt_optimize_graph.py --help. Assuming you used the files provided above, you would run:

$ bash tensorrt_optimize_graph.sh

Inference Error

Native model(i.e.,Frozen model) can successfully implement the inference procedure,however, the converted model with tensorrt are crashed during inference phrase. Detail error information as shown folows:

Traceback (most recent call last): File "/root/anaconda3/lib/python3.4/site-packages/tensorflow/python/framework/importer.py", line 489, in import_graph_def graph._c_graph, serialized, options) # pylint: disable=protected-access tensorflow.python.framework.errors_impl.InvalidArgumentError: Shape must be rank 2 but is rank 4 for 'import/dense_p7/MatMul' (op: 'MatMul') with input shapes: [1,256,1,1], [256,1]

Original trained model can successfully complete the inference, but the converted model with tensorrt 3.0 are failed at inference phrase. Why the shape are [1,256,1,1], original should be [1,256]? If the tensorrt are not supported the tf.matmul() function? if not, how can we do to solve the problem?

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
classification		classification
2017-07-24-16_51_57_0.png		2017-07-24-16_51_57_0.png
README.md		README.md
export_inference_graph.py		export_inference_graph.py
freeze.sh		freeze.sh
freeze_graph.py		freeze_graph.py
freeze_inference_graph.py		freeze_inference_graph.py
freeze_inference_graph.sh		freeze_inference_graph.sh
label.json		label.json
preprocess_tct_image.py		preprocess_tct_image.py
tensorrt_optimize_graph.py		tensorrt_optimize_graph.py
tensorrt_optimize_graph.sh		tensorrt_optimize_graph.sh
test_tct_reference_time.py		test_tct_reference_time.py
test_tct_reference_time.sh		test_tct_reference_time.sh
unnormal_patch.py		unnormal_patch.py

shiyuangogogo/TCT_versionII-master

Folders and files

Latest commit

History

Repository files navigation

Environment：

Software：

Additional libraries:

Load saved keras weights and convert to tensorflow backend

Freezing the exported Graph

Optimize the Resnet-50 with TensorRT and Run the reference

Inference Error

About

Resources

Stars

Watchers

Forks

Languages