Neural Networks are Surprisingly Modular

Results

Clustering and p-value plots
Lesion test: MNIST, MNIST+Dropout, Fashion, Fashion+Dropout
Learning curves notebook

Instructions

We use make with a Makefile to automate the project.

make datasets - Build all datasets (deterministic),
make models - Train all NN models, both MLP and CNN.
make test - Run tests (with pytest).
make mlp-clustering - Run the notebook notebooks/mlp-clustering.ipynb that cluster all MLP models (including alternative explanation ones), and save the results as a table into results/mlp-clustering.csv.
make mlp-lesion - Run the notebook notebooks/mlp-lesion-test.ipynb that perform the lesion test on all standard MLP models, and save the results as a table into results/mlp-lesion.xlsx.
make mlp-double-lesion - Run the notebook notebooks/mlp-double-lesion-test.ipynb that perform the double lesion test on all standard MLP models.
make mlp-learning-curve - Run the notebook notebooks/mlp-learning-curve.ipynb that plot the learning curves to selected set of MLP models.
make mlp-clustering-stability - Run the notebook notebooks/mlp-clustering-stability.ipynb that train and cluster multiple trained instanced of all of the MLP models (including alternative explanation ones), and save the results as a table into results/mlp-clustering-stability-statistic.csv (NOTE: read the comment in the notebook about src/train_nn.py before running it).
make mlp-plots - Run the notebook nootebooks/mlp-plots.ipynb that generates many of the plots from the ICML 2020 paper.

Research Environment Setup

Requirements: Python 3.7 (It might work with an earlier version, but it wasn't tested)

There are two options to set up the environment:

Using a Python virtual environment
Using a Docker container

1. Python Virtual Environment

Clone this repository
Install graphviz
1. Ubuntu/Debian: apt intall graphviz
2. MacOS: brew install graphviz
Install with pipenv install --dev
On MacOs only, you will need to install pygraphviz separatly: pipenv run pip install pygraphviz --install-option="--include-path=/usr/local/Cellar/
To enter the virtual environment, type pipenv shell

2. Docker

Useful: Lifecycle of Docker Container

Building the image (done if you've made changes you want to run on a Docker elsewhere)

Clone the repository and change to the devops directory.

docker build -t nnsurprisinglymodular/nn-clustering .

Running the Container

To get the container, run

docker pull nnsurprisinglymodular/nn-clusterin

First, you need a port number to your Jupyter Netbook - pick up a random number in the range 8000-8500.

Run:

Remove the comments before, and
Replace <PORT NUMBER> with your random port number (also in the instructions that will come later)

docker run \
-it \
-p <PORT NUMBER>:8888 \
--rm \
--name nn_clustering-$(whoami) \
--runtime=nvidia \  # REMOVE, if you don't have GPU
nnsurprisinglymodular/nn-clustering:latest \
bash

And then type

bash build.sh

This will by default not download model checkpoints, becuase they are many gigabytes of files. If you want to download checkpoints as well, run

bash build.sh --download_all

NB: to leave the container, use ctrl-P ctrl-Q. Typing exit will destroy the container.

Interacting With Container

docker exec \
-it nn_clustering-$(whoami) \
bash

Running a Jupyter Notebook

Run this command:

jupyter notebook --allow-root --no-browser --ip=0.0.0.0 --port=8888

If the container is on your computer, just open the browser at the address: http://localhost:8888

If the container is on another server, you should forward the 8888 port through ssh on your personal machine with the command:

ssh -N -L localhost:8888:localhost:<PORT NUMBER> -i <PATH TO SSH PUBLIC KEY>  <USERNAME>@<SERVER ADDRESS>

After doing this, you can then open the jupyter notebook in your browser.

`tmux`

It is advised to learn how to use tmux, and run the Jupyter Notebook on a separate window: https://github.com/tmux/tmux/wiki

3. AWS S3

To upload files or directories:

aws s3 cp --recursive <local> s3://nn-clustering/<remote>

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
devops		devops
notebooks		notebooks
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
commands_to_run.sh		commands_to_run.sh
patching.sh		patching.sh
prepare_all.sh		prepare_all.sh
requirements.txt		requirements.txt
tests		tests

ardapekis/modularity

Folders and files

Latest commit

History

Repository files navigation

Neural Networks are Surprisingly Modular

Results

Instructions

Research Environment Setup

1. Python Virtual Environment

2. Docker

Building the image (done if you've made changes you want to run on a Docker elsewhere)

Running the Container

Interacting With Container

Running a Jupyter Notebook

tmux

3. AWS S3

About

Resources

Stars

Watchers

Forks

Languages

`tmux`