Advanced hyperparameter optimization of deep learning models for wind power prediction
Advanced hyperparameter optimization of deep learning models for wind power prediction
The uncertainty of wind power as the main obstacle of its integration into the power grid can be addressed by an accurate and efficient wind power forecast. Among the various wind power forecasting methods, machine learning (ML) algorithms, are recognized as a powerful wind power forecasting tool, however, their performance is highly dependent on the proper tuning of their hyperparameters. Common hyperparameter tuning methods such as grid search or random search are time-consuming, computationally expensive, and unreliable for complex models such as deep learning neural networks. Therefore, there is an urgent need for automatic methods to discover optimal hyperparameters for higher accuracy and efficiency of prediction models. In this study, a novel investigation is contributed to the field of wind power forecasting by a comprehensive comparison of three advanced techniques – Scikit-opt, Optuna, and Hyperopt – for hyperparameter optimization of Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM) models, a facet that, to our knowledge, has not been systematically explored in existing literature. The impact of these optimization techniques on the accuracy and efficiency of the CNN and LSTM models are assessed by comparing the root mean square error (RMSE) of the predictions and the required time to tune the models. The results show that the Optuna algorithm, using a Tree-structured Parzen Estimator (TPE) search method and Expected Improvement (EI) acquisition function, has the best efficiency for both CNN and LSTM models. In terms of accuracy, it is demonstrated that while for the CNN model all the optimization methods achieve similar performances, the LSTM model optimized by the Hyperopt algorithm, based on the annealing search method, results in the highest accuracy. In addition, for the first time in this research, the impact of the random initialization features on the performance of the forecasting models with neural networks is investigated. The proposed structures for deep learning models were examined to determine the most robust structure with the minimal sensitivity to the randomness. What we have discovered from the comparison of advanced hyperparameter optimization methods can be used by researchers to tune the time series-based forecasting models.
Acquisition function, Bayesian optimization, Gaussian process, Hyperparameter optimization, Random initialization, Wind power forecasting
Hanifi, Shahram
f15c3cbe-01ae-4540-ba04-868b18eb9fbe
Cammarono, Andrea
a2eadfda-5e98-4b85-a8fc-97d003bccc74
Zare-Behtash, Hossein
74be9b97-cb09-49c6-9f75-7ec58c0dd16c
8 December 2023
Hanifi, Shahram
f15c3cbe-01ae-4540-ba04-868b18eb9fbe
Cammarono, Andrea
a2eadfda-5e98-4b85-a8fc-97d003bccc74
Zare-Behtash, Hossein
74be9b97-cb09-49c6-9f75-7ec58c0dd16c
Hanifi, Shahram, Cammarono, Andrea and Zare-Behtash, Hossein
(2023)
Advanced hyperparameter optimization of deep learning models for wind power prediction.
Renewable Energy, 221, [119700].
(doi:10.1016/j.renene.2023.119700).
Abstract
The uncertainty of wind power as the main obstacle of its integration into the power grid can be addressed by an accurate and efficient wind power forecast. Among the various wind power forecasting methods, machine learning (ML) algorithms, are recognized as a powerful wind power forecasting tool, however, their performance is highly dependent on the proper tuning of their hyperparameters. Common hyperparameter tuning methods such as grid search or random search are time-consuming, computationally expensive, and unreliable for complex models such as deep learning neural networks. Therefore, there is an urgent need for automatic methods to discover optimal hyperparameters for higher accuracy and efficiency of prediction models. In this study, a novel investigation is contributed to the field of wind power forecasting by a comprehensive comparison of three advanced techniques – Scikit-opt, Optuna, and Hyperopt – for hyperparameter optimization of Convolutional Neural Network (CNN) and Long Short-Term Memory Network (LSTM) models, a facet that, to our knowledge, has not been systematically explored in existing literature. The impact of these optimization techniques on the accuracy and efficiency of the CNN and LSTM models are assessed by comparing the root mean square error (RMSE) of the predictions and the required time to tune the models. The results show that the Optuna algorithm, using a Tree-structured Parzen Estimator (TPE) search method and Expected Improvement (EI) acquisition function, has the best efficiency for both CNN and LSTM models. In terms of accuracy, it is demonstrated that while for the CNN model all the optimization methods achieve similar performances, the LSTM model optimized by the Hyperopt algorithm, based on the annealing search method, results in the highest accuracy. In addition, for the first time in this research, the impact of the random initialization features on the performance of the forecasting models with neural networks is investigated. The proposed structures for deep learning models were examined to determine the most robust structure with the minimal sensitivity to the randomness. What we have discovered from the comparison of advanced hyperparameter optimization methods can be used by researchers to tune the time series-based forecasting models.
Text
1-s2.0-S0960148123016154-main
- Version of Record
More information
Accepted/In Press date: 19 November 2023
e-pub ahead of print date: 2 December 2023
Published date: 8 December 2023
Keywords:
Acquisition function, Bayesian optimization, Gaussian process, Hyperparameter optimization, Random initialization, Wind power forecasting
Identifiers
Local EPrints ID: 491076
URI: http://eprints.soton.ac.uk/id/eprint/491076
ISSN: 0960-1481
PURE UUID: 18570431-1d60-442e-bf84-da4936bd583d
Catalogue record
Date deposited: 11 Jun 2024 23:51
Last modified: 15 Jun 2024 02:11
Export record
Altmetrics
Contributors
Author:
Shahram Hanifi
Author:
Andrea Cammarono
Author:
Hossein Zare-Behtash
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics