VRDL_HW2

code for Selected Topics in Visual Recognition using Deep Learning Homework 2

Hardware

Ubuntu 16.04 LTS
NVIDIA 1080ti

Installation

Clone this repo
Install the required packages:

apt-get install tk-dev python-tk

Install the python packages:

pip install pandas
pip install pycocotools
pip install opencv-python
pip install requests

Train model by running Train.py:
Output predicted result by running Test.py and upload to Google drive submission

Training

The network can be trained using the train.py script. We use HW2 as our dataset.

python train.py --dataset HW2 --HW2_path ../train --depth 50

In Train.py, we preprocess the data and train our model.

Pre-trained model

A pre-trained model is available at:

https://drive.google.com/open?id=1yLmjq3JtXi841yXWBxst0coAgR26MNBS The state dict model can be loaded using:

retinanet = model.resnet50(num_classes=dataset_train.num_classes(),)
retinanet.load_state_dict(torch.load(PATH_TO_WEIGHTS))

Data preprocess

Then we do image preprocessing in the following order.

Normalize
Augmentation
Resize And remember to resize the annotation when you resize the corresponding image.
After preprocessing, we use dataloader with batch_size=10 to feed training data into our model.

Train our model

python train.py --dataset HW2 --HW2_path ../train --depth 50

We use RetinaNet as backbone network, and modify the class numbers to 10.
10 is the total class numbers of our training_data (0~9).

We use focalLoss as loss function and Adam optimizer, and save the model every single epochs in total 100 epochs.
Set learning rate = 1e-4.
The model parameters would be saved in folder saved_models_3/.

Test accuracy by running test.py

python test.py

For testing data, we first use simliar dataloader to load testing images.
Then we import the saved model.

retinanet = torch.load('saved_models_3/HW2_retinanet_xx.pt')
retinanet = retinanet.cuda()
retinanet.eval()

For each testing image, we normalize and resize it.
In the end, write image id and predicted result to a .josn file.
Upload google drive submission.
Best accuracy : mAP : 0.46387.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
retinanet		retinanet
README.md		README.md
coco_validation.py		coco_validation.py
test.py		test.py
train.py		train.py
visualize.py		visualize.py
visualize_single_image.py		visualize_single_image.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retinanet

retinanet

README.md

README.md

coco_validation.py

coco_validation.py

test.py

test.py

train.py

train.py

visualize.py

visualize.py

visualize_single_image.py

visualize_single_image.py

Repository files navigation

VRDL_HW2

Hardware

Installation

Training

Pre-trained model

Data preprocess

Train our model

Test accuracy by running test.py

About

Releases

Packages

Languages

paul12716/VRDL_HW2

Folders and files

Latest commit

History

Repository files navigation

VRDL_HW2

Hardware

Installation

Training

Pre-trained model

Data preprocess

Train our model

Test accuracy by running test.py

About

Resources

Stars

Watchers

Forks

Languages