Interpretable XGBoost-based predictions of shear wave velocity from CPTu data
Interpretable XGBoost-based predictions of shear wave velocity from CPTu data
Accurate estimation of shear wave velocity (Vs) is critical for offshore geotechnical design, yet direct measurements remain sparse due to cost and logistical constraints. Empirical correlations of Vs from cone penetration test data are derived for specific conditions, thus introducing uncertainty when applied more generally. Machine learning (ML) based correlations have become popular, yet to date have prioritised accuracy over interpretability. To address this gap and enhance transparency, this study integrates the computationally efficient XGBoost technique with SHapley Additive exPlanations (SHAP) to predict Vs and attribute prediction contributions to individual input features. A combined open-source dataset of 7485 paired cone penetration test data with pore pressure measurement (CPTu) and Vs measurements was integrated and used for training and validation. SHAP analysis on a testing dataset of 1526 samples shows that depth, corrected cone resistance (qt), and sleeve friction (fs) are the most influential features, with depth increasing in importance when the Vs predicted deviates from the mean Vs in the training dataset. Compared to a widely used empirical correlation, the ML approach demonstrated superior accuracy across most cases, while also offering insight into decision-making logic. This study highlights the value of interpretable ML in offshore site investigations, with this specific CPTu-Vs interpretable ML model particularly relevant to bottom-fixed foundation designs for offshore wind developments, which are governed by stiffness criteria, in Vs data-limited projects.
Machine learning interpretability, Offshore wind, Seismic and standard piezocone tests, Shear wave velocity, XGBoost
Marín-Moreno, Héctor
e466cafd-bd5c-47a1-8522-e6938e7086a4
Willis, James
03095bd6-11a8-4f84-8e48-5b6009184faf
Zhang, Yuting
821b7687-fe98-4525-b641-2ea503797319
Gourvenec, Susan
6ff91ad8-1a91-42fe-a3f4-1b5d6f5ce0b8
Marín-Moreno, Héctor
e466cafd-bd5c-47a1-8522-e6938e7086a4
Willis, James
03095bd6-11a8-4f84-8e48-5b6009184faf
Zhang, Yuting
821b7687-fe98-4525-b641-2ea503797319
Gourvenec, Susan
6ff91ad8-1a91-42fe-a3f4-1b5d6f5ce0b8
Marín-Moreno, Héctor, Willis, James, Zhang, Yuting and Gourvenec, Susan
(2025)
Interpretable XGBoost-based predictions of shear wave velocity from CPTu data.
Marine Geophysical Research, 47 (1), [5].
(doi:10.1007/s11001-025-09602-6).
Abstract
Accurate estimation of shear wave velocity (Vs) is critical for offshore geotechnical design, yet direct measurements remain sparse due to cost and logistical constraints. Empirical correlations of Vs from cone penetration test data are derived for specific conditions, thus introducing uncertainty when applied more generally. Machine learning (ML) based correlations have become popular, yet to date have prioritised accuracy over interpretability. To address this gap and enhance transparency, this study integrates the computationally efficient XGBoost technique with SHapley Additive exPlanations (SHAP) to predict Vs and attribute prediction contributions to individual input features. A combined open-source dataset of 7485 paired cone penetration test data with pore pressure measurement (CPTu) and Vs measurements was integrated and used for training and validation. SHAP analysis on a testing dataset of 1526 samples shows that depth, corrected cone resistance (qt), and sleeve friction (fs) are the most influential features, with depth increasing in importance when the Vs predicted deviates from the mean Vs in the training dataset. Compared to a widely used empirical correlation, the ML approach demonstrated superior accuracy across most cases, while also offering insight into decision-making logic. This study highlights the value of interpretable ML in offshore site investigations, with this specific CPTu-Vs interpretable ML model particularly relevant to bottom-fixed foundation designs for offshore wind developments, which are governed by stiffness criteria, in Vs data-limited projects.
Text
s11001-025-09602-6
- Version of Record
More information
e-pub ahead of print date: 20 December 2025
Keywords:
Machine learning interpretability, Offshore wind, Seismic and standard piezocone tests, Shear wave velocity, XGBoost
Identifiers
Local EPrints ID: 509016
URI: http://eprints.soton.ac.uk/id/eprint/509016
ISSN: 0025-3235
PURE UUID: a74a6ee4-ad22-4b37-bcb6-5061b98cc33e
Catalogue record
Date deposited: 10 Feb 2026 17:34
Last modified: 11 Feb 2026 03:16
Export record
Altmetrics
Contributors
Author:
Héctor Marín-Moreno
Author:
James Willis
Author:
Yuting Zhang
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics