# README File for Scripts Supporting Thesis **Title:** *Data-Driven Modelling of Nonlinear Aerodynamics in High-Speed Aircraft Using Machine Learning* **Dataset DOI:** 10.5258/SOTON/D3670 **Author:** Gabriele Immordino, University of Southampton **ORCID ID:** 0000-0003-2718-0120 **Awarded by:** University of Southampton **Date of Award:** 2025 **Data Collection Period:** September 2021 – September 2025 **Location:** Zurich University of Applied Sciences (ZHAW), Switzerland **Licence:** Creative Commons Attribution (CC-BY) **Funding:** Digitalization Initiative of the Zurich Higher Education Institutions (DIZH) from Zurich University of Applied Sciences (ZHAW), grant 9710.Z.12.P.0003.05 --- ## -------------------- ## DATA & FILE OVERVIEW ## -------------------- This repository provides the **scripts** used to generate the results of the thesis. **Folder structure:** * **Chapter\_4** – *Parametric Nonlinear Volterra Series* * Scripts for kernel extraction from CFD and synthetic systems. * ML-based kernel prediction (FCNN, GPR). * Flutter boundary estimation, including multi-fidelity co-Kriging. * Includes: * Synthetic test case. * 2D CFD test case. * 3D CFD test case. * **Chapter\_5** – *Multi-Fidelity Bayesian Neural Network* * Scripts for probabilistic load prediction in the transonic regime. * Multi-fidelity integration (low, mid, high) with transfer learning. * Bayesian neural networks with uncertainty quantification. * **Chapter\_6** – *Deep Learning for Steady Transonic Flowfields* * Fully connected neural networks for surface field prediction. * Applications to aeroelastic twist and uncertainty quantification. * Evaluation on BSCW, ONERA M6, NASA CRM. * **Chapter\_7** – *Graph-Convolutional Autoencoder (Steady Flowfields)* * GCN-based autoencoder for steady aerodynamic fields on unstructured meshes. * Physics-informed loss and multi-resolution pooling. * **Chapter\_8** – *Spatio-Temporal GCN (Unsteady Flowfields)* * Implements feedforward and ARMA-based STGCN architectures. * Predicts unsteady pressure fields from motion histories. * Includes preprocessing, dimensionality reduction, and autoencoder pretraining pipelines. * **Chapter\_9** – *Graph-Convolutional Autoencoder for Modal Space* * GCN autoencoder for parametrically deformed wing shapes. * Incorporates structural mode amplitudes in the input. * Includes modal sampling, MWLS pooling, and physics-informed training. **Relationship between files:** Each folder corresponds to one thesis chapter and contains notebooks and Python modules to reproduce the relevant experiments. --- ## -------------------------- ## METHODOLOGICAL INFORMATION ## -------------------------- **Approach per chapter:** * *Chapter 4*: Volterra kernel identification (linear/quadratic), ML-based prediction, flutter analysis, and multi-fidelity fusion. * *Chapter 5*: Bayesian neural networks with transfer learning, uncertainty quantification, and Co-Kriging comparison. * *Chapter 6*: Dense FCNNs for steady aerodynamic field regression, with extensions to aeroelastic deformation and uncertainty propagation. * *Chapter 7*: Autoencoder GCN with hierarchical pooling/unpooling, moment-regularised loss, and Bayesian hyperparameter optimisation. * *Chapter 8*: Spatio-temporal GCNs (feedforward/ARMA) with windowed CFD data; integrates motion states with graph-encoded CP fields. * *Chapter 9*: GCN autoencoder on modal-deformed geometries, with MWLS pooling and physics-informed objectives. **Software requirements:** * **General:** Python ≥ 3.9, NumPy, SciPy, Matplotlib, Pandas. * **Machine Learning:** TensorFlow/Keras, PyTorch, PyTorch Geometric, Optuna, GPy. **Workflow (generalised):** 1. **Data preparation:** Preprocessing of CFD outputs (normalisation, mesh connectivity, pressure gradients, modal coordinates). 2. **Model definition:** Notebooks and Python modules specify neural architectures and loss functions. 3. **Training:** Models trained with hyperparameter tuning (Optuna, Keras-tuner). 4. **Evaluation:** Predictions validated against reference CFD; results visualised with provided plotting utilities. **Quality assurance:** * CFD data checked for convergence; unstable snapshots excluded. * ML models benchmarked against baselines (Proper Orthogonal Decomposition with Interpolation (POD+I), Dynamic Mode Decomposition with control (DMDc), Co-Kriging). * Scripts tested on GPU-enabled environments (NVIDIA RTX A4000).