Virtual environments on cluster

Basic libraries

Run

$ conda create -n env_thales python=3.6 numpy pip
$ source activate env_dhi
$ pip install scipy
$ pip install matplotlib
$ pip install h5py
$ pip install tensorflow-gpu 
$ conda install -c menpo opencv
$ pip install pandas
$ conda install gdal
$ conda install -c ioos rtree 
$ pip install centerline
$ pip install osmnx
$ pip install http://download.pytorch.org/whl/cu90/torch-0.3.1-cp36-cp36m-linux_x86_64.whl
$ pip install torchvision

pay attention to the cuda version installed, you need to know what version of tensorflow-gpu and cuda/cdnn is corresponding to add it to the bashrc

Add to bashrc for TensorFlow 1.5 is expecting Cuda 9.0 ( NOT 9.1 ), as well as cuDNN 7

export LD_LIBRARY_PATH=/usr/local/cuda-9.0/lib64:$LD_LIBRARY_PATH
export PATH=/usr/local/cuda-9.0/bin:$PATH
export CUDA_HOME=/usr/local/cuda-9.0
export LD_LIBRARY_PATH=/usr/local/cuDNNv7.0-8/lib64:$LD_LIBRARY_PATH
export LD_LIBRARY_PATH=/usr/lib/x86_64-linux-gnu/:$LD_LIBRARY_PATH

Create jupyter notebook

For jupyter notebook

$pip install ipykernel
$python -m ipykernel install --user --name=env_dhi

On the cluster

$CUDA_VISIBLE_DEVICES=0 jupyter notebook --no-browser --port=8888

From local machine

$ssh -N -f -L localhost:8881:localhost:8888 s161362@mnemosyne.compute.dtu.dk

AWS to download Spacenet Dataset

AWS create account Get the credentials keys on the desktop of AWS online create a bucket and make it "requester payer" (see: https://docs.aws.amazon.com/fr_fr/AmazonS3/latest/dev/configure-requester-pays-console.html ) install aws console:

$ pip install awscli

put credentials connection info (only put key and secret key, the rest do enter)

$ aws configure

check what is in the bucket spaceNet

$ aws s3 ls spacenet-dataset --request-payer requester

get the list of what is in the bucket

$ aws s3api list-objects --bucket spacenet-dataset --request-payer requester

Download Building Dataset Spacenet

Rio

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_1_Rio/processedData/processedBuildingLabels.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_1_RIO/processedBuildingLabels.tar.gz

Vegas

Train

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_2_Vegas/AOI_2_Vegas_Train.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_2_Vegas/AOI_2_Vegas_Train.tar.gz

Test

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_2_Vegas/AOI_2_Vegas_Test_public.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_2_Vegas/AOI_2_Vegas_Test_public.tar.gz

Paris

Train

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_3_Paris/AOI_3_Paris_Train.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_3_Paris/AOI_3_Paris_Train.tar.gz

Test

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_3_Paris/AOI_3_Paris_Test_public.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_3_Paris/AOI_3_Paris_Test_public.tar.gz

Shanghai

Train

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_4_Shanghai/AOI_4_Shanghai_Train.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_4_Shanghai/AOI_4_Shanghai_Train.tar.gz

Test

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_4_Shanghai/AOI_4_Shanghai_Test_public.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_4_Shanghai/AOI_4_Shanghai_Test_public.tar.gz

Karthoum

Train

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_5_Khartoum/AOI_5_Khartoum_Train.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_5_Khartoum/AOI_5_Khartoum_Train.tar.gz

Test

$ aws s3api get-object --bucket spacenet-dataset \
    --key AOI_5_Khartoum/AOI_5_Khartoum_Test_public.tar.gz \
    --request-payer requester /scratch/SPACENET_DATA/BUILDING_DATASET/AOI_5_Khartoum/AOI_5_Khartoum_Test_public.tar.gz

Best Model

After some months of master thesis, several models have been used and compared and this repository contains the best model. This model is a Res-Unet (https://arxiv.org/abs/1505.04597) with batch normalization and dropout layers. It has been combined to a distance module presented in https://arxiv.org/abs/1709.05932. Data augmentation has been performed on the training set. The network has first been trained on the Spacenet dataset (see model in 'TRAINED_MODELS/RUBV3D2_final_model_spacenet.pth') and then transfer learning has been performed on ghana dataset ('TRAINED_MODELS/RUBV3D2_final_model_ghana.pth'). The metric the most important is the not the pixel wise error but the F1 score of the Spacenet Challenge described in https://github.com/SpaceNetChallenge/utilities and that can be found in IOU_computations.py.

I am currently trying to finalize domain space adapatation and the use of another loss more adpated to pixel wise segmentation (http://blog.kaggle.com/2017/05/09/dstl-satellite-imagery-competition-3rd-place-winners-interview-vladimir-sergey/). I also would like to run my model on the official test set of Spacenet challenge using their code of evaluation and upload on their platform my results.

Model is in RUBV3D2.py

To run the training, use, on a CUDA gpu devices:

$python train_model.py 'path_folder_to_dataset' 'path_folder_to_store_model' 'name_model' 'path_file_to_model_to_restore' --epochs=10 --iou_step=15

Other parameters can be set and are well explained in the script train_model.py or in the notebook train_model.ipynb which is very playful and it is good to use this notebook to get familiar to the training process before launching the script train_model.py at higher scale.

The notebook real_time_loss_tracker.ipynb allows to track the metrics on the validation and training set during a training experience, which can take a very long time.

Tipycally, training on data set of Ghana, takes up to a couple of hours rather than for the Spacenet dataset, it is about a couple of days.

The notebook predict.ipynb allows to predict any patch from the test set.

The notebook evaluation_test_set.py.ipynb allows to compute metrics on the whole test set to be able to compare models.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Data_Handle		Data_Handle
TRAINED_MODELS		TRAINED_MODELS
.gitignore		.gitignore
IOU_computations.py		IOU_computations.py
README.md		README.md
RUBV3D2.py		RUBV3D2.py
evaluation_test_set.py.ipynb		evaluation_test_set.py.ipynb
predict.ipynb		predict.ipynb
predict_and_evaluate.py		predict_and_evaluate.py
prepare_datasets.md		prepare_datasets.md
real_time_loss_tracker.ipynb		real_time_loss_tracker.ipynb
train_model.ipynb		train_model.ipynb
train_model.py		train_model.py

DHI-GRAS/dhi-segmentation-buildings

Folders and files

Latest commit

History

Repository files navigation

Virtual environments on cluster

Basic libraries

Add to bashrc for TensorFlow 1.5 is expecting Cuda 9.0 ( NOT 9.1 ), as well as cuDNN 7

Create jupyter notebook

For jupyter notebook

On the cluster

From local machine

AWS to download Spacenet Dataset

Rio

Vegas

Train

Test

Paris

Train

Test

Shanghai

Train

Test

Karthoum

Train

Test

Best Model

About

Resources

Stars

Watchers

Forks

Languages