Learning Based Solar Forecasting

Overview

This project aims at improving intra-day Ground Horizontal Irradiance (GHI) forecasting using machine learning based algorithms. Predicting the amount solar energy in the next few hours/days is of great importance to power system operations and control. Currently two methods are implemented and documented below. If you have any questions with respect to this project, please contact me through email (my unique name is `hmshen` I am willing to help/answer any questions related to these two parts).

NOTICE: If you need to use the code below (even part of it), I would recommend you read it critically before using it.

Github repo for this project

Link to Github repo (click me!)

Dependencies

Tensorflow, Sklearn, Pandas, Numpy, Scipy, ….

I would recommend you use Anaconda Python.

To clone this repository into your server/local machine, execute the following command

git clone https://github.com/hm-shen/learning-based-weather-forecasting

SVM Based Solar Irradiance Forecasting

This part is based on paper:

Hourly Solar Irradiance Prediction Based on Support Vector Machine and Its Error Analysis (click me!)

Getting Started

Three modes are implemented: weather_prediction, houldout_trainint, grid_search. weather_prediction will load the input data and complete solar irradiance prediction; holdout_training will split the input data into holdout training and run test on the trained model; grid_search will search for good parameters in a set of given parameters.

To run this project, you can directly python main.py after manually setting the running mode parameters in main.py file.

Implementation

Given a set of training data, K-means algorithm is used to separate data into three cluster where each of them implies a specific weather type (cloudy, partly cloudy, sunny). Then, code will train a Support Vector Regression model for each type of weather.

LSTM Based Cloud Fraction Forecasting

Since both hourly GHI and hourly cloud fraction can be considered as time series, solving it using Long short-term memory (LSTM) becomes natural. This part implements a simple LSTM model for supervised cloud cover and GHI forecasting.

Getting Started

To perform cloud fraction prediction on NREL data sets, please run the following command from folder lstm/src/:

python -p /path/to/NREL_data/ \
       -f 'average or variance' \
       -o /path/to/output/ \
       -c /path/to/configuration file/ \
       -n 'name of the dataset' \

Two example bash scripts are included in lstm/src folder. You can execute them by running (NREL data for cloud fraction forecasting or WRF data for solar irradiance forecasting)

./run_nrel_cloud.sh

or

./run_wrf_solar.sh

Implementation

Note that this implementation is based on https://github.com/tgjeon/TensorFlow-Tutorials-for-Time-Series (click me!)

Overall Framework of LSTM

Tutorial on LSTM model can be found here (click me!)

All diagram of the architecture is shown below

+-----------------------------+
|      Linear Regression      |
+-----------------------------+
               ^
               |
+--------------+--------------+
|      Fully connect ANN      |
+-----------------------------+
               ^
               |
+--------------+--------------+
|      Fully connect ANN      |
+-----------------------------+
   ^         ^             ^
   |         |             |
+--+--+   +--+--+       +--+--+
| RNN |-->| RNN | .. -->| RNN |
+-----+   +-----+       +-----+

where the LSTM cell is represented as “unrolled” RNN cells. So for each input \(x_t\), \(y_t\) will be generated by LSTM cell and feed into two layers of fully connected artificial neural networks (ANN). Then the output is used as the input to a linear regressor.

Parameters

Parameters for LSTM are listed below:

Parameters	Description
time steps	how many time steps is used to predict (i.e. features)
rnn layers	configuration of rnn layers using a list of dict
dense layers	number of units in each dense layer

Some details on data preprocessing

Those NREL data contained in the lstm/data/ folder is a little bit messy in the sense that there may be invalid cloud fraction data in each day (e.g. nan, -1). Thus, to remove days with too many messy data, there are two variables, ubd_min, lbd_max, responsible for removing all invalid days (days with too many bad data): all days where the first valid data appearing later than ubd_min is removed; similarly, all days where the last valid data appearing before lbd_max is removed. This way, we select days with number of valid data at least (lbd_max - ubd_min). Also note that these two variables are related to the dataset you are using and thus should be set by hand in the source code /src/driver.py.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
docs		docs
lstm		lstm
svr		svr

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

lstm

lstm

svr

svr

Repository files navigation

Learning Based Solar Forecasting

Overview

Github repo for this project

Dependencies

SVM Based Solar Irradiance Forecasting

Getting Started

Implementation

LSTM Based Cloud Fraction Forecasting

Getting Started

Implementation

Overall Framework of LSTM

Parameters

Some details on data preprocessing

About

Releases

Packages

Languages

hm-shen/learning-based-weather-forecasting

Folders and files

Latest commit

History

Repository files navigation

Learning Based Solar Forecasting

Overview

Github repo for this project

Dependencies

SVM Based Solar Irradiance Forecasting

Getting Started

Implementation

LSTM Based Cloud Fraction Forecasting

Getting Started

Implementation

Overall Framework of LSTM

Parameters

Some details on data preprocessing

About

Resources

Stars

Watchers

Forks

Languages