MUSCO: Multi-Stage COmpression of neural networks

This repository contains supplementary code for the paper MUSCO: Multi-Stage COmpression of neural networks. It demonstrates how a neural network with convolutional and fully connected layers can be compressed using iterative tensor decomposition of weight tensors.

tensor_decomposition folder contains code to decompose convolutional and fully connected weights using Tucker-2/CP3/CP4/ tensor decompositions and SVD decomposition, respectively.

demo folder contains several notebooks and scripts that demonstrate how to

compress convolutional/fully connected layers of any neural network using different tensor decomposition,
iteratively compress neural network model by alternating compression and fine-tunning steps.

Prerequisites

PyTorch-1.0 and torchvision
numpy
scipy
copy
scikit-tensor
tensorly
shutil
os
absl-py

Docker container

To avoid package installation routine, create docker container (titeled my_container, for example) to work in using docker image https://hub.docker.com/r/jgusak/tensor_compression_od. Use port 4567 (or any port you like) to run jupyter notebook at.

nvidia-docker run --name my_container -it -v musco:/workspace/musco -v <datasets_dir>:/workspace/raid -p 4567:8888  jgusak/tensor_compression_od

In this example /workspace/musco is the folder inside the container, where the content of the current repository will be stored (PWD variable from model_utils/load_utils.py defines path to this folder).

/workspace/raid is a directory where datasets and models folders are stored at a docker container.

<datasets_dir> is a folder at a host machine where datasets and models are stored.

-p 4567:8888 a port 8888 from docker container is mapped to 4567 on a host machine by this option. If you want to map all ports from a docker container to the corresponding ports at a host machine use --net="host" instead.

Example (Ipython-notebook)

Data preparation

Prepare the dataset by storing it in DATA_ROOT folder (DATA_ROOT can be specified in model_utils/load_utils.py).

Model compression

Please set path to your working directory by changing PWD variable inside model_utils/load_utils.py.

Pretrained models are needed. Please, download them and specify path to the pretrained models by setting PATH_TO_PRETRAINED variable from model_utils/load_utils.py.

Follow the instructions in demo/compression_conv_fc.ipynb to apply tensor decomposition to the selected convolutional layers. Compression can be applied to any custom loaded neural network.

Iterative compression

Notebook demo/iterative_finetuning_conv_fc.ipynb demonstrates how to perform iterative compression.

Checkpoints are saved in results folder in working directory (to modify default save path, go to model_utils/load_utils.py and change SAVE_ROOT variable).

Example (Python script)

To perform iterative compression run demo/evaluate_demo.py script (a complete list of startup parameters can be found inside the script).

For example, to perform two-stage compression of VGG-16 (compress all layers - fine-tune - further compress all layers - fine-tune) using Tucker-2 tensor approximation of weight tensors with Bayesian rank selection (using VBMF and rank weakening with weakenen factor 0.6), run

        python evaluate_demo.py --model vgg16 \
                                --model_weights /workspace/raid/data/eponomarev/pretrained/imagenet/vgg16-397923af.pth \
                                --dataset imagenet \
                                --data_dir /workspace/raid/data/datasets \
                                --batches_per_train 10000000 \
                                --batches_per_val 10000000 \
                                --batch_size 64 \                                
                                --save_dir /workspace/raid/data/lmarkeeva/new_exp \
                                --conv_split 1 \
                                --validate_before_ft \
                                --ft_epochs 15 \
                                --patience 3 \
                                --compress_iters 2 \
                                --gpu_number 0 \
                                --weaken_factor 0.6

Don't forget to change absolute pathes to the model weights, data and save folder in the above example.

Citing

If you used our research, we kindly ask you to cite the corresponding paper.

@article{gusak2019one,
  title={One time is not enough: iterative tensor decomposition for neural network compression},
  author={Gusak, Julia and Kholyavchenko, Maksym and Ponomarev, Evgeny and Markeeva, Larisa and Oseledets, Ivan and Cichocki, Andrzej},
  journal={arXiv preprint arXiv:1903.09973},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
demo		demo
flopco		flopco
model_utils		model_utils
tensor_compression		tensor_compression
.gitignore		.gitignore
README.md		README.md
dataloaders.py		dataloaders.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo

demo

flopco

flopco

model_utils

model_utils

tensor_compression

tensor_compression

.gitignore

.gitignore

README.md

README.md

dataloaders.py

dataloaders.py

Repository files navigation

MUSCO: Multi-Stage COmpression of neural networks

Prerequisites

Docker container

Example (Ipython-notebook)

Example (Python script)

Citing

About

Releases

Packages

Languages

Salman-Ahmadi-Asl/musco

Folders and files

Latest commit

History

Repository files navigation

MUSCO: Multi-Stage COmpression of neural networks

Prerequisites

Docker container

Example (Ipython-notebook)

Example (Python script)

Citing

About

Resources

Stars

Watchers

Forks

Languages