deep_nowcaster

Introduction

This repository contains all code required to reproduce results from my Masters Thesis Exploration into machine learning techniques for precipitation nowcasting. As a byproduct this repo also contains APIs to access GPS RINEX files from FTP database, and ASOS data from NCDC. We also open source the script to map weather variables to GPS stations.

Dependencies

numpy
scipy
netCDF
scikit-learn
cuda
theano
lasagne

Build Training and Test dataset

The first step is to build the train/test data set of evolving precipitation fields and evolving moisture fields (termed as NIPW - Normalized Integrated Precipitable Water) each of which is a 100 x 100 matrix stored as a numpy array for a given timestep t. Before running the script please ensure you download the data(~1.7 GB uncompressed) required to make these fields using the from the link here. This file contains the raw radar data from NCDC in NetCDF format and the point measurements of IPW (Integrated Precipitable Water Vapor from the 44 GPS stations around KFWS for the years 2014,2015 and 2016). The following script plots all the images for the storm dates in our experiment and also stores the images in a numpy array inside data/dataset/YYYY. From inside the Preprocessing_code directory run:

python reflectivity_ipw_movies.py

48 plots (NIPW and reflectivity sampled at 30 minute intervals) and numpy arrays for each day are generated. The following shows example plots of the precipitation fields overlapped over the NIPW fields. The video sequence of the evolving precipitation fields and NIPW fields can be found in this youtube video.

We explore machine learning techniques which can capture the spatiotemporal relationships between the evolving precipitation fields and NIPW fields to be able to nowcast precipitation.

Train and Test models

BuildDataSet.py and other scripts in the includes directory contains helper functions to build training and validation data sets and calculating performance metrics, ensure that includes directory is added to your PYTHON_PATH variables.

Random Forest(RF) Classifier

We train a random forest classifier using a set of features engineered by taking the spatial statistics of the 33 x 33 window of points around the pixel point we are predicting. The set of routines in BuildDataSet.py does this for us and creates a dataset ready to be trained by the Random Forest. From inside the RandomForest_code directory, run the following script:

python RF_prediction_experiments.py True 60 ipw_refl 600 RF_60prediction_ipw_refl_experiment 6

which trains a RF classifier with 600 trees in the forest and a max depth of 6 and saves the results in the file 600RF_60prediction_refl_experiment_max_depth6.pkl. The file contains all the performance metrics evaluated in the training and validation set as defined by the class NOWCAST_performance() in ModelMetrics.py.

Convolutional Neural Networks(CNN)

Unlike the Random Forest classifier we feed the CNN with the actual 33 x 33 frames around the pixel point as features. The weights of convolution filters are learnt for each variable at each time step. The following script runs a single layer CNN with separate connections for the precipitation fields and separate connections for the NIPW fields. From inside the CNN_code directory run:

python Deep_NN_nowcasting_experiments.py

The program first creates the training and validation dataset inside the directory data/TrainTest/points/. We train our CNN using a Tesla K80 GPU on MGHPCC.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
CNN_code		CNN_code
Preprocessing_code		Preprocessing_code
RandomForest_code		RandomForest_code
UCAR_IPW		UCAR_IPW
code		code
includes		includes
storm_cases		storm_cases
trial_code		trial_code
verification_code		verification_code
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CNN_code

CNN_code

Preprocessing_code

Preprocessing_code

RandomForest_code

RandomForest_code

UCAR_IPW

UCAR_IPW

code

code

includes

includes

storm_cases

storm_cases

trial_code

trial_code

verification_code

verification_code

.gitignore

.gitignore

README.md

README.md

Repository files navigation

deep_nowcaster

Introduction

Dependencies

Build Training and Test dataset

Train and Test models

Random Forest(RF) Classifier

Convolutional Neural Networks(CNN)

About

Releases

Packages

Languages

adityanagara/deep_nowcaster

Folders and files

Latest commit

History

Repository files navigation

deep_nowcaster

Introduction

Dependencies

Build Training and Test dataset

Train and Test models

Random Forest(RF) Classifier

Convolutional Neural Networks(CNN)

About

Resources

Stars

Watchers

Forks

Languages