The University of Southampton
University of Southampton Institutional Repository

Advanced hyperparameter optimization of deep learning models for wind power prediction

Advanced hyperparameter optimization of deep learning models for wind power prediction
Advanced hyperparameter optimization of deep learning models for wind power prediction

The uncertainty of wind power as the main obstacle of its integration into the power grid can be addressed by an accurate and efficient wind power forecast. Among the various wind power forecasting methods, machine learning (ML) algorithms, are recognized as a powerful wind power forecasting tool, however, their performance is highly dependent on the proper tuning of their hyperparameters. Common hyperparameter tuning methods such as grid search or random search are time-consuming, computationally expensive, and unreliable for complex models such as deep learning neural networks. Therefore, there is an urgent need for automatic methods to discover optimal hyperparameters for higher accuracy and efficiency of prediction models. In this study, a novel investigation is contributed to the field of wind power forecasting by a comprehensive comparison of three advanced techniques – Scikit-opt, Optuna, and Hyperopt – for hyperparameter optimization of Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM) models, a facet that, to our knowledge, has not been systematically explored in existing literature. The impact of these optimization techniques on the accuracy and efficiency of the CNN and LSTM models are assessed by comparing the root mean square error (RMSE) of the predictions and the required time to tune the models. The results show that the Optuna algorithm, using a Tree-structured Parzen Estimator (TPE) search method and Expected Improvement (EI) acquisition function, has the best efficiency for both CNN and LSTM models. In terms of accuracy, it is demonstrated that while for the CNN model all the optimization methods achieve similar performances, the LSTM model optimized by the Hyperopt algorithm, based on the annealing search method, results in the highest accuracy. In addition, for the first time in this research, the impact of the random initialization features on the performance of the forecasting models with neural networks is investigated. The proposed structures for deep learning models were examined to determine the most robust structure with the minimal sensitivity to the randomness. What we have discovered from the comparison of advanced hyperparameter optimization methods can be used by researchers to tune the time series-based forecasting models.

Acquisition function, Bayesian optimization, Gaussian process, Hyperparameter optimization, Random initialization, Wind power forecasting
0960-1481
Hanifi, Shahram
f15c3cbe-01ae-4540-ba04-868b18eb9fbe
Cammarono, Andrea
a2eadfda-5e98-4b85-a8fc-97d003bccc74
Zare-Behtash, Hossein
74be9b97-cb09-49c6-9f75-7ec58c0dd16c
Hanifi, Shahram
f15c3cbe-01ae-4540-ba04-868b18eb9fbe
Cammarono, Andrea
a2eadfda-5e98-4b85-a8fc-97d003bccc74
Zare-Behtash, Hossein
74be9b97-cb09-49c6-9f75-7ec58c0dd16c

Hanifi, Shahram, Cammarono, Andrea and Zare-Behtash, Hossein (2023) Advanced hyperparameter optimization of deep learning models for wind power prediction. Renewable Energy, 221, [119700]. (doi:10.1016/j.renene.2023.119700).

Record type: Article

Abstract

The uncertainty of wind power as the main obstacle of its integration into the power grid can be addressed by an accurate and efficient wind power forecast. Among the various wind power forecasting methods, machine learning (ML) algorithms, are recognized as a powerful wind power forecasting tool, however, their performance is highly dependent on the proper tuning of their hyperparameters. Common hyperparameter tuning methods such as grid search or random search are time-consuming, computationally expensive, and unreliable for complex models such as deep learning neural networks. Therefore, there is an urgent need for automatic methods to discover optimal hyperparameters for higher accuracy and efficiency of prediction models. In this study, a novel investigation is contributed to the field of wind power forecasting by a comprehensive comparison of three advanced techniques – Scikit-opt, Optuna, and Hyperopt – for hyperparameter optimization of Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM) models, a facet that, to our knowledge, has not been systematically explored in existing literature. The impact of these optimization techniques on the accuracy and efficiency of the CNN and LSTM models are assessed by comparing the root mean square error (RMSE) of the predictions and the required time to tune the models. The results show that the Optuna algorithm, using a Tree-structured Parzen Estimator (TPE) search method and Expected Improvement (EI) acquisition function, has the best efficiency for both CNN and LSTM models. In terms of accuracy, it is demonstrated that while for the CNN model all the optimization methods achieve similar performances, the LSTM model optimized by the Hyperopt algorithm, based on the annealing search method, results in the highest accuracy. In addition, for the first time in this research, the impact of the random initialization features on the performance of the forecasting models with neural networks is investigated. The proposed structures for deep learning models were examined to determine the most robust structure with the minimal sensitivity to the randomness. What we have discovered from the comparison of advanced hyperparameter optimization methods can be used by researchers to tune the time series-based forecasting models.

Text
1-s2.0-S0960148123016154-main - Version of Record
Available under License Creative Commons Attribution.
Download (2MB)

More information

Accepted/In Press date: 19 November 2023
e-pub ahead of print date: 2 December 2023
Published date: 8 December 2023
Keywords: Acquisition function, Bayesian optimization, Gaussian process, Hyperparameter optimization, Random initialization, Wind power forecasting

Identifiers

Local EPrints ID: 491076
URI: http://eprints.soton.ac.uk/id/eprint/491076
ISSN: 0960-1481
PURE UUID: 18570431-1d60-442e-bf84-da4936bd583d
ORCID for Hossein Zare-Behtash: ORCID iD orcid.org/0000-0002-4769-4076

Catalogue record

Date deposited: 11 Jun 2024 23:51
Last modified: 15 Jun 2024 02:11

Export record

Altmetrics

Contributors

Author: Shahram Hanifi
Author: Andrea Cammarono
Author: Hossein Zare-Behtash ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×