The University of Southampton
University of Southampton Institutional Repository

Interpretable XGBoost-based predictions of shear wave velocity from CPTu data

Interpretable XGBoost-based predictions of shear wave velocity from CPTu data
Interpretable XGBoost-based predictions of shear wave velocity from CPTu data
Accurate estimation of shear wave velocity (Vs) is critical for offshore geotechnical design, yet direct measurements remain sparse due to cost and logistical constraints. Empirical correlations of Vs from cone penetration test data are derived for specific conditions, thus introducing uncertainty when applied more generally. Machine learning (ML) based correlations have become popular, yet to date have prioritised accuracy over interpretability. To address this gap and enhance transparency, this study integrates the computationally efficient XGBoost technique with SHapley Additive exPlanations (SHAP) to predict Vs and attribute prediction contributions to individual input features. A combined open-source dataset of 7485 paired cone penetration test data with pore pressure measurement (CPTu) and Vs measurements was integrated and used for training and validation. SHAP analysis on a testing dataset of 1526 samples shows that depth, corrected cone resistance (qt), and sleeve friction (fs) are the most influential features, with depth increasing in importance when the Vs predicted deviates from the mean Vs in the training dataset. Compared to a widely used empirical correlation, the ML approach demonstrated superior accuracy across most cases, while also offering insight into decision-making logic. This study highlights the value of interpretable ML in offshore site investigations, with this specific CPTu-Vs interpretable ML model particularly relevant to bottom-fixed foundation designs for offshore wind developments, which are governed by stiffness criteria, in Vs data-limited projects.

Machine learning interpretability, Offshore wind, Seismic and standard piezocone tests, Shear wave velocity, XGBoost
0025-3235
Marín-Moreno, Héctor
e466cafd-bd5c-47a1-8522-e6938e7086a4
Willis, James
03095bd6-11a8-4f84-8e48-5b6009184faf
Zhang, Yuting
821b7687-fe98-4525-b641-2ea503797319
Gourvenec, Susan
6ff91ad8-1a91-42fe-a3f4-1b5d6f5ce0b8
Marín-Moreno, Héctor
e466cafd-bd5c-47a1-8522-e6938e7086a4
Willis, James
03095bd6-11a8-4f84-8e48-5b6009184faf
Zhang, Yuting
821b7687-fe98-4525-b641-2ea503797319
Gourvenec, Susan
6ff91ad8-1a91-42fe-a3f4-1b5d6f5ce0b8

Marín-Moreno, Héctor, Willis, James, Zhang, Yuting and Gourvenec, Susan (2025) Interpretable XGBoost-based predictions of shear wave velocity from CPTu data. Marine Geophysical Research, 47 (1), [5]. (doi:10.1007/s11001-025-09602-6).

Record type: Article

Abstract

Accurate estimation of shear wave velocity (Vs) is critical for offshore geotechnical design, yet direct measurements remain sparse due to cost and logistical constraints. Empirical correlations of Vs from cone penetration test data are derived for specific conditions, thus introducing uncertainty when applied more generally. Machine learning (ML) based correlations have become popular, yet to date have prioritised accuracy over interpretability. To address this gap and enhance transparency, this study integrates the computationally efficient XGBoost technique with SHapley Additive exPlanations (SHAP) to predict Vs and attribute prediction contributions to individual input features. A combined open-source dataset of 7485 paired cone penetration test data with pore pressure measurement (CPTu) and Vs measurements was integrated and used for training and validation. SHAP analysis on a testing dataset of 1526 samples shows that depth, corrected cone resistance (qt), and sleeve friction (fs) are the most influential features, with depth increasing in importance when the Vs predicted deviates from the mean Vs in the training dataset. Compared to a widely used empirical correlation, the ML approach demonstrated superior accuracy across most cases, while also offering insight into decision-making logic. This study highlights the value of interpretable ML in offshore site investigations, with this specific CPTu-Vs interpretable ML model particularly relevant to bottom-fixed foundation designs for offshore wind developments, which are governed by stiffness criteria, in Vs data-limited projects.

Text
s11001-025-09602-6 - Version of Record
Available under License Creative Commons Attribution.
Download (5MB)

More information

e-pub ahead of print date: 20 December 2025
Keywords: Machine learning interpretability, Offshore wind, Seismic and standard piezocone tests, Shear wave velocity, XGBoost

Identifiers

Local EPrints ID: 509016
URI: http://eprints.soton.ac.uk/id/eprint/509016
ISSN: 0025-3235
PURE UUID: a74a6ee4-ad22-4b37-bcb6-5061b98cc33e
ORCID for Héctor Marín-Moreno: ORCID iD orcid.org/0000-0002-3412-1359
ORCID for Yuting Zhang: ORCID iD orcid.org/0000-0002-5683-7286
ORCID for Susan Gourvenec: ORCID iD orcid.org/0000-0002-2628-7914

Catalogue record

Date deposited: 10 Feb 2026 17:34
Last modified: 11 Feb 2026 03:16

Export record

Altmetrics

Contributors

Author: Héctor Marín-Moreno ORCID iD
Author: James Willis
Author: Yuting Zhang ORCID iD
Author: Susan Gourvenec ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×