Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data
Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data
Energy yield estimation for photovoltaics (PV) plays a crucial role in the growth of renewable energy. To reduce uncertainty in these estimations, having a spectral resolved irradiance is key. In the field of PV, radiative transfer models (RTMs) and spectroradiometers are commonly utilised to determine spectral solar irradiance, which is crucial for assessing spectral effects. However, these methodologies have inherent limitations; RTMs require precise and complex inputs of aerosol and meteorological data, while spectroradiometers entail significant costs. With the advancement of machine learning (ML) techniques, a data-driven spectral irradiance model is proposed in this study, which only requires the global horizontal irradiance (GHI) measured by pyranometer and the reference cell as input. Spectral data and meteorological data collected by Solar Energy Research Institute of Singapore (SERIS) at four sites across three continents are used for the training and testing of our models. We examined the viability on spectra modelling of three ML techniques including Long Short-Term Memory networks (LSTM), Random Forest (RF) algorithms and Extreme Gradient Boost (XGBoost). XGBoost achieves relatively good accuracy; additionally, the computational cost is much lower compared to LSTM and RF. The proposed ML model shows an overall R2 of 0.974 in comparison with 0.646 of the SMARTS model in the spectrum range 350.4–1052.4 nm. The ML models outperform the SMARTS model particularly under intermediate and overcast conditions. We have also shown that a model trained on data from a specific site cannot be effectively applied to other locations.
Zhang, Haoxiang
a65bbdf9-00a7-4f83-9c4b-d30db66cd471
Chaudhary, Sunny
25f0d213-03ef-4909-8cfc-29a8498aa28f
Rodríguez-Gallegos, Carlos D.
e4bdbae0-60fe-4f41-ab82-d58c6d3267e7
Rahman, Tasmiat
e7432efa-2683-484d-9ec6-2f9c568d30cd
12 June 2025
Zhang, Haoxiang
a65bbdf9-00a7-4f83-9c4b-d30db66cd471
Chaudhary, Sunny
25f0d213-03ef-4909-8cfc-29a8498aa28f
Rodríguez-Gallegos, Carlos D.
e4bdbae0-60fe-4f41-ab82-d58c6d3267e7
Rahman, Tasmiat
e7432efa-2683-484d-9ec6-2f9c568d30cd
Zhang, Haoxiang, Chaudhary, Sunny, Rodríguez-Gallegos, Carlos D. and Rahman, Tasmiat
(2025)
Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data.
Next Energy, 8, [100320].
(doi:10.1016/j.nxener.2025.100320).
Abstract
Energy yield estimation for photovoltaics (PV) plays a crucial role in the growth of renewable energy. To reduce uncertainty in these estimations, having a spectral resolved irradiance is key. In the field of PV, radiative transfer models (RTMs) and spectroradiometers are commonly utilised to determine spectral solar irradiance, which is crucial for assessing spectral effects. However, these methodologies have inherent limitations; RTMs require precise and complex inputs of aerosol and meteorological data, while spectroradiometers entail significant costs. With the advancement of machine learning (ML) techniques, a data-driven spectral irradiance model is proposed in this study, which only requires the global horizontal irradiance (GHI) measured by pyranometer and the reference cell as input. Spectral data and meteorological data collected by Solar Energy Research Institute of Singapore (SERIS) at four sites across three continents are used for the training and testing of our models. We examined the viability on spectra modelling of three ML techniques including Long Short-Term Memory networks (LSTM), Random Forest (RF) algorithms and Extreme Gradient Boost (XGBoost). XGBoost achieves relatively good accuracy; additionally, the computational cost is much lower compared to LSTM and RF. The proposed ML model shows an overall R2 of 0.974 in comparison with 0.646 of the SMARTS model in the spectrum range 350.4–1052.4 nm. The ML models outperform the SMARTS model particularly under intermediate and overcast conditions. We have also shown that a model trained on data from a specific site cannot be effectively applied to other locations.
Text
1-s2.0-S2949821X25000833-main
- Version of Record
More information
Accepted/In Press date: 12 May 2025
e-pub ahead of print date: 12 June 2025
Published date: 12 June 2025
Identifiers
Local EPrints ID: 504385
URI: http://eprints.soton.ac.uk/id/eprint/504385
ISSN: 2949-821X
PURE UUID: 5e6d3f55-b693-4001-9702-8d811088e00c
Catalogue record
Date deposited: 08 Sep 2025 17:02
Last modified: 17 Sep 2025 02:14
Export record
Altmetrics
Contributors
Author:
Haoxiang Zhang
Author:
Sunny Chaudhary
Author:
Carlos D. Rodríguez-Gallegos
Author:
Tasmiat Rahman
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics