Time series data - weather forecast

Motivation

Predict a temperature of the next few hours using multiple machine learning algorithm.

Requirements

Python (3.6.0)
Pandas (0.24.1)
NumPy (1.16.0)
Keras (2.2.4)
TensorFlow (1.13.1)
Juypter (4.4.0)
Matplotlib (3.0.2) and Seaborn (0.9.0)

Dataset Overview

Dataset used for weather forecasting was downloaded from the book Deep Learning with Python. The dataset contains recorded weather data comprising of 13 different features from the year 2009 to 2016. The record was in 10 minutes difference, which yields 6 samples per hour and 52557 samples per year. For 8 years, the number of samples is 420550. This dataset is divided into training and testing groups in the ratio of 8:2.

Method

Linear regression (LR) with different regularization and loss function was tried.
Support vector regression (SVR) was also tested.
For cross-validation, RandomizedSearchCV and GridSearchCV were tested. However, RandomizedSearchCV was selected afterward because it produced the superior performance in a limited computing resource.

Programs

A program was developed using linear regression (regularization methods) and support vector regression.

Usage:

$ python3 ./LRModelRandCV.py -h

Example

Linear regression

$ python3 ./LRModelGridCV.py -t -n 8 -i ../data/jena_climate_2015_2016.csv 
  
  -t: includes date and time column feature otherwise it is excluded 
  -n: # of CPUs used to train a model.

Selection of regularization algorithm

$ python3 ./src/LRModelRandCV.py -t -i ./data/jena_climate_2009_2016.csv -n 8 -m lr_ri
 -m accepts lr_ri for ridge, lr_la for lasso, lr_enet for elastic net, and lr_sgd for stochastic gradient regressor

Support vector regressor

$ python36 ./src/SVR.py -t -i ./data/jena_climate_2009_2016.csv -n 8

Performance

Linear Regression showed the best performance of MSE 3.10.
Various flavors (lasso, ridge, elastic net and stochastic gradient descent) of linear regression were tested but the performance was not improved over ordinary linear regression. Lasso showed the worst performance with MSE 3.53; however, the remaining versions showed either similar or worse than ordinary linear regression. The randomized algorithm was used in cross-validation to find the best solution where different value of the alpha parameter of Ridge or Lasso was searched. The best performance was obtained with alpha is 0, which indicates that the regularization term does not contribute to achieve the better performance.
Multiple parameters such as alpha and l₁ ratio were explored for Elastic net and SGD using same searching strategy, the best result for these methods are 3.27 (elastic net with alpha=0.58 and l₁ ratio=0.03) and 3.19 (SGD with alpha=0.58, l₁ penalty and l₁ ratio=0.51. These results could not beat the ordinary linear regression performance.
SVR took multiple weeks for training and validation however it showed the worst performance of with 14.31 MSE.

Disclamier

Opinions expressed are solely my own and do not express the views or opinions of my employer. The author assumes no responsibility or liability for any errors or omissions in the content of this site. The information contained in this site is provided on an “as is” basis with no guarantees of completeness, accuracy, usefulness or timeliness.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
notebook		notebook
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time series data - weather forecast

Motivation

Requirements

Dataset Overview

Method

Programs

Usage:

Example

Performance

Disclamier

About

Releases

Packages

Languages

License

exchhattu/TimeSeriesWeatherForecast-Regression-SVR

Folders and files

Latest commit

History

Repository files navigation

Time series data - weather forecast

Motivation

Requirements

Dataset Overview

Method

Programs

Usage:

Example

Performance

Disclamier

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages