Scene understading for autonomous vehicles - Team 08: CoRN

This project is done in the framework of the Master in Computer Vision by Universitat Autònoma de Barcelona (UAB) in the module M5: Visual recognition.

Abstract: "In this project we employ state-of-the-art Deep Learning algorithms for recognition, detection and segmentation tasks which are key to create a system which understands traffic scenes correctly and makes the right decision while autonomously driving".

For more information, check our report on Overleaf or our presentations on Google Drive. You can find summaries for some Image Classification papers on the file summaries. Additionally, for a quick look at what we have done in the project so far, head over to the 'The progress at a glance section'.

Getting Started

Prerequisites

The software need the following software to run:

If you are using any Linux distribution is most probable you already have a running version of Python installed.

As an example, if you are using an Ubuntu you can install pip and pipenv using these commands:

apt-get install python-pip
pip install pipenv

Installing

To install the virtual environment you only have to run pipenv from project's root directory:

pipenv install

It is recommended to add the following environment vars to your .bashrc:

export PIPENV_VENV_IN_PROJECT=1
export PIPENV_IGNORE_VIRTUALENVS=1
export PIPENV_MAX_DEPTH=1

You can find their explanation as well as more environment variables in configuration with environment variables.

Built With

This project is using Keras as a high-level neural networks API running on top of Tensorflow library.

How to run it

Run a training in the server

cd code
CUDA_VISIBLE_DEVICES=0 python train.py -c config/tt100k_classif.py -e test -l /home/master/tmp -s /data/module5/

Run in local

python code/train.py -c code/config/tt100k_classif.py -e test -l tmp -s data

Pre-trained weights

You can find some the weight from our experiements in this GoogleDrive.

The progress at a glance

Object recognition

Summary

Two weeks have been devoted to the study of state-of-the-art architectures for object recognition. In particular, we have evaluated the vainilla VGG16 architecture weights to recognise traffic signals from the TT100K dataset. We compared the performance between cropped and resized images as well as the adaption of the network to another domain (BelgiumTS). Moreover, we have trained the VGG16 both from scratch and with pre-trained weights from ImageNet on the KITTI dataset and analysed their results. Finally, we have implemented the Squeeze and Excitation Network (added to models and compared the vainilla ResNet50 results with its SE counterpart, both with fine-tuning and from scratch. Each dataset has been analysed to help us draw meaningful conclusions from the results obtained. Check our report and presentation for more details.

Results

Model	Train acc	Valid acc	Test acc
Vainilla VGG16	0.995	0.962	0.955
SE-ResNet50	0.998	0.942	0.939
VGG16(KITTI)	0.888	0.904	-
+ w. imagenet	0.978	0.975	-
VGG16(TT100K)	0.990	0.842	0.814
+ resize	0.981	0.842	0.820
+ crop	0.945	0.836	0.866
+ fine-tune BTS	0.789	0.767	0.767

Object detection

Summary

Two weeks have been devoted to the study and implementation of state-of-the-art architectures for object detection. We have tested the vainilla YOLO (v1) architecture with both TT100K (for detection, instead of using crops containing only signals) and Udacity datasets and analysed overffiting and unbalancing problems we encountered. We have proposed possible solutions and implemented some, like data augmentation and re-splitting the partition given. We also worked hard in implementing an SSD but we could not finished it. We can see the loss decreasing and getting to a plateau during the training but we have no bounding boxes to assess the detections. Check our report and presentation for more details.

Results

Model	Precision	Recall	F-score
YOLO	.198	.144	1.67
+ WB DA	.276	.219	.244
+ re-splitting	.204	.148	.171
SSD	-	-	-

Week 5/6: object segmentation

Summary

Results

Authors

Ferran Pérez - dev - fperezgamonal - contact
Joan Francesc Serracant - dev - fserracant - contact
Jonatan Poveda - dev - jonpoveda - contact
Martí Cobos - dev - marticobos - contact

License

This project is licensed under the GPLv3 License - see the LICENSE.md file for details

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
Summaries		Summaries
code		code
.gitignore		.gitignore
LICENSE.md		LICENSE.md
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

License

selfdriving-fun/scene-understanding-for-autonomous-vehicles

Folders and files

Latest commit

History

Repository files navigation

Scene understading for autonomous vehicles - Team 08: CoRN

Table of contents

Getting Started

Prerequisites

Installing

Built With

How to run it

Run a training in the server

Run in local

Pre-trained weights

The progress at a glance

Object recognition

Summary

Results

Object detection

Summary

Results

Week 5/6: object segmentation

Summary

Results

Authors

License

About

Resources

License

Stars

Watchers

Forks

Languages