OpenTrafficMonitoring+

This is the Detectron2 based implementation supplemented with the following publications:

Vehicle Position Estimation with Aerial Imagery from Unmanned Aerial Vehicles (1)
Accuracy Characterization of the Vehicle State Estimation from Aerial Imagery (2)

What's new

Code is now completely written in Python - all libraries are open source
Mask-RCNN: the PyTorch Detectron2 implementation
This implementation yields Linux
It is faster and requires less GPU RAM
For the original implementation see OpenTrafficMonitoring.
Execution time in milliseconds (varies with the number of objects):

The repository includes:

Pre-Processing: Images from Video, Image Registration
Vehicle Detection: Mask R-CNN built on ResNet50+FPN
Pre-trained weights: To reproduce the results from the publications or to directly apply to your own data set
Post-Processing: Generate rotated bounding Boxes, perform object tracking, estimate vehicle state variables like position, speed, acceleration and orientation
Plot results: Visualize the bounding boxes and state variables on the images and crate a video output
Example data used for the publications, see below

Here is one example from the benchmark experiments:

The figure depicts an bird's eye view on an orthorectified map of the test track at 50 m flight altitude. Depicted is the trajectory of the reference in black and drone in red, respectively. The car was driven in spirals to capture different vehicle poses and positions in the image frame. The blue rectangle and cross indicate the image frame and center. For estimation purposes, a true-to-scale T-junction is painted white on the test track.

Which accuracy can be expected?

The figure below depicts the error curves we obtained from the experiments.

Here is a detailed Table with position error results from publication (1), where the detection is performed frame-by-frame, i.e. no smoothing/filtering is applied:

Installation

Install Detectron2
Clone this repository
Install dependencies (take care of the release versions and also the installation order of cv2 and cv2-contrib)
```
pip install -r requirements.txt
```
Download our weights and store them in the maskrcnn folder. The network was trained on approximately 9000 vehicle instances (cars, trucks, busses), mainly from European and Asian data sources. Note: This labeled dataset cannot be provided.

Getting started

To check if your installation works, try our example video. Open a terminal in the src folder and type:

python main.py --videos=./videos/:./out/
# '--videos' argument should be in the format: inputfolder:outputfolder

This will create a folder named [out] where the output is stored.
The code supports multiple-video processing: all videos in the videos folder are beeing processed sequentially.
You can also provide images instead of videos:

python main.py --images=./images/:./out/ --config=default
# '--config' argument refers to the 'config_name', not video_name !

Note that a config must be specified when using images, since the config cannot be inferred by the video name.

Example data and training weights

One example video is already included in this repository (DJI_0029_example.mp4). For further data, i.e. pre-trained weights, the labeled data of the test vehicle or one complete video from the paper data set, download the data hosted on the FAU University server or the mirrors on Google Drive:

Example video frames from publication (1) ~ 6GB
Labeled training data set from test track | mirror
1920x1080 weights or 1280x720 weights | mirror 1080 | mirror 720
Full-HD resolution yields better results

Apply your own training

If you want to increase the detection performance, create a labeled training data set and apply transfer learning.
The easiest way to start is by annotating the data using the open sourced VIA Image Annotator.
To get started with an example, check our provided labeled data set used for the publictations.
Following steps are necessary to create your training/validation data set with our approach:

Label the images with the VIA Image Annotator
Export your annotations as COCO JSON file and name the file via_export_coco.json.
run python vgg_to_coco.py in the src folder to get the transformed.json annotations (check the inputs and outputs (folders) defined in vgg_to_coco.py)
Store your training files (images and transformed.json annotation file) in a new folder (for example './custom_data').
Train:

# Train from COCO data set 
    python train.py

# Train from our weights
    python train.py --dataset_train=./custom_data/train_data/ --weights=./maskrcnn/model_final.pth
# run python train.py --help for a detailed list of all possible arguments

Config

The config provides an execution context for every video that gets processed. It contains the tuning parameters for all the various steps (e.g. drone height, spatial resolution, etc...).

If your videos are very much the same you can just alter the default config file.

You can also create new configs in the src/config.py file and add it to the 'configs_to_consider' list. The different configs are then chosen by their corresponding 'video_name' (via a simple str.contains()). If no matching config found, the default config is selected.

example:

# config.py
new_config_a = default_config.copy({
    "config_name": "config for 50m footage",
    "video_name": "50m",
    "meter_to_pixel": 0.07,
    "drone_height":50,
    # ....
})

new_config_b = default_config.copy({
    "config_name": "config for xyz videos",
    "video_name": "0045",
    "register_images": False,
    "drone_height": 75,
    # ....
})

configs_to_consider = [new_config_a, new_config_b]

In this example, we create 2 new configs in src/config.py which differ from the default config by the values we override. If we were to process the following 3 videos the config mapping would look like this:

drone_recordings_50m.mp4 -> (uses new_config_a because it contains '50m')
footage_0045_0001.avi -> (uses new_config_b because it contains '0045')
city_20min.mp4 -> (uses default config since there is no matching config for this video name)

Important tuning parameters

The output quality is affected by a various number of parameters, see our publications for more details. Some of the most important ones are briefly discussed here:

Image registration: To perform the tracking and vehicle state estimation, the video sequence requires a fixed frame. This is reached with image registration. An important tuning parameter here is the Hessian threshold. To compute the scaling, rotation and translation offsets between two image frames, a certain number of matching features has to be found by the algorithm. Lowering the Hessian threshold too much can affect the registration results. Generally, as indirectly shown in the papers by achieving results with few pixels error, this algorithm is very robust.
Ground Sampling Distance (spatial resolution): Determines the photo resolution, here in cm/px. To obtain best results, we used 3 Ground Control Points and measured the distances between the points with D-GPS in RTK mode (~1cm accuracy). The position of these Ground Control Points are then determined in the first image frame in pixels. We assume quadratic pixels, so that the conversion from meter to pixel is straight forward. Finally we took the average of the 3 distances from 3 control points. Maximizing the distance between points yields smaller errors.
Kalman-Filter tuning parameters: The tuning parameters are validated for flight heights between 50m and 100m, Full-HD resolution and 50 fps.
Clean-Up: False detections can usually be cleaned up by setting a minimum average speed and number of consecutive detections (see config.py). To avoid wicked values when an object enters the image (due to increasing object size), you can also set the parameter to delete the first n frames.
Training / Inference: Parameters regarding the learning rate, region proposals, max. number of detections, Non-maximum Suppression (NMS) threshold etc. can be tuned in the Detectron2 framework. Our setup can be checked in inference.py and train.py, where some variables are beeing override from the default Detectron2 values.

Requirements

Detectron2, Python 3.6 or 3.7 and packages listed in requirements.txt
Tested on:

Ubuntu 18.04, Python 3.6.9, torch 1.4.0, torchvision 0.5.0, CUDA 10.2, Nvidia 440.x drivers
Intel Xeon E-2176G, 32 GB RAM, GeForce RTX 2080 Ti, M.2 SDD

Citation

To cite the this repository:

@misc{KruberGithub.2020,
 author = {{F. Kruber} and E. {S\'{a}nchez Morales} and R. {Egolf}},
 date = {2020},
 title = {{OpenTrafficMonitoring+}},
 journal={GitHub repository},
 url = {\url{https://github.com/fkthi}}
}

To cite our publications:

@INPROCEEDINGS{KruberIV20,
author={F. {Kruber} and E. {S\'{a}nchez Morales} and S. {Chakraborty} and M. {Botsch}},
booktitle={2020 IEEE Intelligent Vehicles Symposium (IV)},
title={{Vehicle Position Estimation with Aerial Imagery from Unmanned Aerial Vehicles}},
year={2020}
}

@INPROCEEDINGS{SanchezMoralesIV20,
author={E. {S\'{a}nchez Morales} and F. {Kruber} and M. {Botsch} and B. {Huber} and  A. {Garc{\'i}a Higuera}},
booktitle={2020 IEEE Intelligent Vehicles Symposium (IV)},
title={{Accuracy Characterization of the Vehicle State Estimation from Aerial Imagery}},
year={2020}
}

Acknowledgment

The authors acknowledge the financial support by the Federal Ministry of Education and Research of Germany (BMBF) in the framework of FH-Impuls (project number 03FH7I02IA).

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

OpenTrafficMonitoring+

What's new

The repository includes:

Contents

Here is one example from the benchmark experiments:

Which accuracy can be expected?

Installation

Getting started

Example data and training weights

Apply your own training

Config

Important tuning parameters

Requirements

Citation

Acknowledgment

About

Releases

Packages

Contributors 2

Languages

License

fkthi/OpenTrafficMonitoringPlus

Folders and files

Latest commit

History

Repository files navigation

OpenTrafficMonitoring+

What's new

The repository includes:

Contents

Here is one example from the benchmark experiments:

Which accuracy can be expected?

Installation

Getting started

Example data and training weights

Apply your own training

Config

Important tuning parameters

Requirements

Citation

Acknowledgment

About

Resources

License

Stars

Watchers

Forks

Languages