The University of Southampton
University of Southampton Institutional Repository

Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data

Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data
Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data
Energy yield estimation for photovoltaics (PV) plays a crucial role in the growth of renewable energy. To reduce uncertainty in these estimations, having a spectral resolved irradiance is key. In the field of PV, radiative transfer models (RTMs) and spectroradiometers are commonly utilised to determine spectral solar irradiance, which is crucial for assessing spectral effects. However, these methodologies have inherent limitations; RTMs require precise and complex inputs of aerosol and meteorological data, while spectroradiometers entail significant costs. With the advancement of machine learning (ML) techniques, a data-driven spectral irradiance model is proposed in this study, which only requires the global horizontal irradiance (GHI) measured by pyranometer and the reference cell as input. Spectral data and meteorological data collected by Solar Energy Research Institute of Singapore (SERIS) at four sites across three continents are used for the training and testing of our models. We examined the viability on spectra modelling of three ML techniques including Long Short-Term Memory networks (LSTM), Random Forest (RF) algorithms and Extreme Gradient Boost (XGBoost). XGBoost achieves relatively good accuracy; additionally, the computational cost is much lower compared to LSTM and RF. The proposed ML model shows an overall R2 of 0.974 in comparison with 0.646 of the SMARTS model in the spectrum range 350.4–1052.4 nm. The ML models outperform the SMARTS model particularly under intermediate and overcast conditions. We have also shown that a model trained on data from a specific site cannot be effectively applied to other locations.
2949-821X
Zhang, Haoxiang
a65bbdf9-00a7-4f83-9c4b-d30db66cd471
Chaudhary, Sunny
25f0d213-03ef-4909-8cfc-29a8498aa28f
Rodríguez-Gallegos, Carlos D.
e4bdbae0-60fe-4f41-ab82-d58c6d3267e7
Rahman, Tasmiat
e7432efa-2683-484d-9ec6-2f9c568d30cd
Zhang, Haoxiang
a65bbdf9-00a7-4f83-9c4b-d30db66cd471
Chaudhary, Sunny
25f0d213-03ef-4909-8cfc-29a8498aa28f
Rodríguez-Gallegos, Carlos D.
e4bdbae0-60fe-4f41-ab82-d58c6d3267e7
Rahman, Tasmiat
e7432efa-2683-484d-9ec6-2f9c568d30cd

Zhang, Haoxiang, Chaudhary, Sunny, Rodríguez-Gallegos, Carlos D. and Rahman, Tasmiat (2025) Advancements in solar spectral irradiance modelling for photovoltaic systems: a machine learning approach utilising on-site data. Next Energy, 8, [100320]. (doi:10.1016/j.nxener.2025.100320).

Record type: Article

Abstract

Energy yield estimation for photovoltaics (PV) plays a crucial role in the growth of renewable energy. To reduce uncertainty in these estimations, having a spectral resolved irradiance is key. In the field of PV, radiative transfer models (RTMs) and spectroradiometers are commonly utilised to determine spectral solar irradiance, which is crucial for assessing spectral effects. However, these methodologies have inherent limitations; RTMs require precise and complex inputs of aerosol and meteorological data, while spectroradiometers entail significant costs. With the advancement of machine learning (ML) techniques, a data-driven spectral irradiance model is proposed in this study, which only requires the global horizontal irradiance (GHI) measured by pyranometer and the reference cell as input. Spectral data and meteorological data collected by Solar Energy Research Institute of Singapore (SERIS) at four sites across three continents are used for the training and testing of our models. We examined the viability on spectra modelling of three ML techniques including Long Short-Term Memory networks (LSTM), Random Forest (RF) algorithms and Extreme Gradient Boost (XGBoost). XGBoost achieves relatively good accuracy; additionally, the computational cost is much lower compared to LSTM and RF. The proposed ML model shows an overall R2 of 0.974 in comparison with 0.646 of the SMARTS model in the spectrum range 350.4–1052.4 nm. The ML models outperform the SMARTS model particularly under intermediate and overcast conditions. We have also shown that a model trained on data from a specific site cannot be effectively applied to other locations.

Text
1-s2.0-S2949821X25000833-main - Version of Record
Available under License Creative Commons Attribution.
Download (8MB)

More information

Accepted/In Press date: 12 May 2025
e-pub ahead of print date: 12 June 2025
Published date: 12 June 2025

Identifiers

Local EPrints ID: 504385
URI: http://eprints.soton.ac.uk/id/eprint/504385
ISSN: 2949-821X
PURE UUID: 5e6d3f55-b693-4001-9702-8d811088e00c
ORCID for Sunny Chaudhary: ORCID iD orcid.org/0000-0003-2664-7083
ORCID for Tasmiat Rahman: ORCID iD orcid.org/0000-0002-6485-2128

Catalogue record

Date deposited: 08 Sep 2025 17:02
Last modified: 17 Sep 2025 02:14

Export record

Altmetrics

Contributors

Author: Haoxiang Zhang
Author: Sunny Chaudhary ORCID iD
Author: Carlos D. Rodríguez-Gallegos
Author: Tasmiat Rahman ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×