Advancing V estimation from CPTu for engineering practice: a data-driven approach
Advancing V estimation from CPTu for engineering practice: a data-driven approach
Shear wave velocity, V s, is a critical parameter for offshore site characterisation to estimate the small strain shear modulus, which is essential for subsequent geotechnical designs. Direct measurements of V s are often sparse due to time and resource constraints, while indirect estimations of V s based on empirical correlations can exhibit significant errors. This study presents the performance of 125 models with various combinations of standard piezocone tests (CPTu) input features (e.g., depth, z; sleeve friction resistance, f s; corrected cone tip resistance, q t; and pore pressure at the shoulder of the cone, u 2), CPTu and V s data pairing methods, and prediction techniques (support vector regression (SVR), random forest regression (RFR), extreme gradient boosting regression (XGBR), deep neural network (DNN) and multiple linear regression (MLR)). To do this, we compile a seismic piezocone test (SCPTu) database from onshore and offshore sites across the globe (Netherlands, Austria, Germany, Nepal, and Taipei) and consider five different methods for pairing CPTu data (resolution of 0.02 m) and V s data (resolution of 0.5 m and 1 m depending on the dataset). Two cases consider the more conventional downsampling of CPTu data to V s data. The remaining three methods consider augmented V s data to the resolution of CPTu measurements, to fully utilise all the CPTu data. Results indicate that data augmentation enhances predictive performance. Incorporating pore pressure as an input feature also improves model performance, particularly in cemented materials such as chalk. In contrast, the derived features have a negligible influence. The recommended model combines a DNN with four directly measured CPTu parameters (z,f s,q t,and u 2), and uses an augmentation method that assumes constant V s values within each V s interval. This model achieves a mean absolute error (MAE) of 37.3 m/s and a coefficient of determination (R 2) of 0.59.
Integrated geoscience, Machine learning techniques, Seismic piezocone tests (SCPTu), Shear wave velocity, Standard piezocone tests (CPTu)
Zhang, Yuting
821b7687-fe98-4525-b641-2ea503797319
Marín-Moreno, Héctor
e466cafd-bd5c-47a1-8522-e6938e7086a4
Gourvenec, Susan
6ff91ad8-1a91-42fe-a3f4-1b5d6f5ce0b8
1 February 2026
Zhang, Yuting
821b7687-fe98-4525-b641-2ea503797319
Marín-Moreno, Héctor
e466cafd-bd5c-47a1-8522-e6938e7086a4
Gourvenec, Susan
6ff91ad8-1a91-42fe-a3f4-1b5d6f5ce0b8
Zhang, Yuting, Marín-Moreno, Héctor and Gourvenec, Susan
(2026)
Advancing V estimation from CPTu for engineering practice: a data-driven approach.
Soil Dynamics and Earthquake Engineering, 201, [109972].
(doi:10.1016/j.soildyn.2025.109972).
Abstract
Shear wave velocity, V s, is a critical parameter for offshore site characterisation to estimate the small strain shear modulus, which is essential for subsequent geotechnical designs. Direct measurements of V s are often sparse due to time and resource constraints, while indirect estimations of V s based on empirical correlations can exhibit significant errors. This study presents the performance of 125 models with various combinations of standard piezocone tests (CPTu) input features (e.g., depth, z; sleeve friction resistance, f s; corrected cone tip resistance, q t; and pore pressure at the shoulder of the cone, u 2), CPTu and V s data pairing methods, and prediction techniques (support vector regression (SVR), random forest regression (RFR), extreme gradient boosting regression (XGBR), deep neural network (DNN) and multiple linear regression (MLR)). To do this, we compile a seismic piezocone test (SCPTu) database from onshore and offshore sites across the globe (Netherlands, Austria, Germany, Nepal, and Taipei) and consider five different methods for pairing CPTu data (resolution of 0.02 m) and V s data (resolution of 0.5 m and 1 m depending on the dataset). Two cases consider the more conventional downsampling of CPTu data to V s data. The remaining three methods consider augmented V s data to the resolution of CPTu measurements, to fully utilise all the CPTu data. Results indicate that data augmentation enhances predictive performance. Incorporating pore pressure as an input feature also improves model performance, particularly in cemented materials such as chalk. In contrast, the derived features have a negligible influence. The recommended model combines a DNN with four directly measured CPTu parameters (z,f s,q t,and u 2), and uses an augmentation method that assumes constant V s values within each V s interval. This model achieves a mean absolute error (MAE) of 37.3 m/s and a coefficient of determination (R 2) of 0.59.
Text
1-s2.0-S0267726125007663-main
- Version of Record
More information
Accepted/In Press date: 18 November 2025
e-pub ahead of print date: 22 November 2025
Published date: 1 February 2026
Additional Information:
Publisher Copyright:
© 2025 The Authors.
Keywords:
Integrated geoscience, Machine learning techniques, Seismic piezocone tests (SCPTu), Shear wave velocity, Standard piezocone tests (CPTu)
Identifiers
Local EPrints ID: 507513
URI: http://eprints.soton.ac.uk/id/eprint/507513
ISSN: 0267-7261
PURE UUID: dd752b3d-148f-4e58-a3d5-a7e8e52b6de2
Catalogue record
Date deposited: 10 Dec 2025 17:55
Last modified: 11 Dec 2025 03:14
Export record
Altmetrics
Contributors
Author:
Yuting Zhang
Author:
Héctor Marín-Moreno
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics