Bayesian design for calibration of physical models
Bayesian design for calibration of physical models
We often want to learn about physical processes that are described by complex nonlinear mathematical models implemented as computer simulators. To use a simulator to make predictions about the real physical process, it is necessary to first perform calibration; that is, to use data obtained from a physical experiment to make inference about unknown parameters whilst acknowledging discrepancies between the simulator and reality. The computational expense of many simulators makes calibration challenging. Thus, usually in calibration, we use a computationally cheaper approximation to the simulator, often referred to as an emulator, constructed by fitting a statistical model to the results of a relatively small computer experiment. Although there is a substantial literature on the choice of the design of the computer experiment, the problem of designing the physical experiment in calibration is much less well-studied. This thesis is concerned with methodology for Bayesian optimal designs for the physical experiment when the aim is estimation of the unknown parameters in the simulator.
Optimal Bayesian design for most realistic statistical models, including those incorporating expensive computer simulators, is complicated by the need to numerically approximate an analytically intractable expected utility; for example, the expected gain in Shannon information from the prior to posterior distribution. The standard approximation method is "double-loop" Monte Carlo integration using nested sampling from the prior distribution. Although this method is easy to implement, it produces biased approximations and is computationally expensive. For the Shannon information gain utility, we propose new approximation methods which combine features of importance sampling and Laplace approximations.
These approximations are then used within an optimisation algorithm to find optimal designs for three problems: (i) estimation of the parameters in a nonlinear regression model; (ii) parameter estimation for a misspecified regression model subject to discrepancy; and (iii) estimation of the calibration parameters for a computational expensive simulator. Through examples, we demonstrate the advantages of this combination of methodology over existing methods.
University of Southampton
Englezou, Yiolanda
49a1a99a-d7f4-4816-850a-ebcf64f12c9e
July 2018
Englezou, Yiolanda
49a1a99a-d7f4-4816-850a-ebcf64f12c9e
Woods, David
ae21f7e2-29d9-4f55-98a2-639c5e44c79c
Englezou, Yiolanda
(2018)
Bayesian design for calibration of physical models.
University of Southampton, Doctoral Thesis, 242pp.
Record type:
Thesis
(Doctoral)
Abstract
We often want to learn about physical processes that are described by complex nonlinear mathematical models implemented as computer simulators. To use a simulator to make predictions about the real physical process, it is necessary to first perform calibration; that is, to use data obtained from a physical experiment to make inference about unknown parameters whilst acknowledging discrepancies between the simulator and reality. The computational expense of many simulators makes calibration challenging. Thus, usually in calibration, we use a computationally cheaper approximation to the simulator, often referred to as an emulator, constructed by fitting a statistical model to the results of a relatively small computer experiment. Although there is a substantial literature on the choice of the design of the computer experiment, the problem of designing the physical experiment in calibration is much less well-studied. This thesis is concerned with methodology for Bayesian optimal designs for the physical experiment when the aim is estimation of the unknown parameters in the simulator.
Optimal Bayesian design for most realistic statistical models, including those incorporating expensive computer simulators, is complicated by the need to numerically approximate an analytically intractable expected utility; for example, the expected gain in Shannon information from the prior to posterior distribution. The standard approximation method is "double-loop" Monte Carlo integration using nested sampling from the prior distribution. Although this method is easy to implement, it produces biased approximations and is computationally expensive. For the Shannon information gain utility, we propose new approximation methods which combine features of importance sampling and Laplace approximations.
These approximations are then used within an optimisation algorithm to find optimal designs for three problems: (i) estimation of the parameters in a nonlinear regression model; (ii) parameter estimation for a misspecified regression model subject to discrepancy; and (iii) estimation of the calibration parameters for a computational expensive simulator. Through examples, we demonstrate the advantages of this combination of methodology over existing methods.
Text
Final PhD thesis YEnglezou
- Version of Record
More information
Published date: July 2018
Identifiers
Local EPrints ID: 427145
URI: http://eprints.soton.ac.uk/id/eprint/427145
PURE UUID: e1554737-e2c9-44dd-8528-0d70a7b658ac
Catalogue record
Date deposited: 03 Jan 2019 17:30
Last modified: 16 Mar 2024 03:15
Export record
Contributors
Author:
Yiolanda Englezou
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics