Using derivative information in the statistical analysis of computer models
Using derivative information in the statistical analysis of computer models
Complex deterministic models are an important tool for studying a wide range of systems. Often though, such models are computationally too expensive to perform the many runs required. In this case one option is to build a Gaussian process emulator which acts as a surrogate, enabling fast prediction of the model output at specified input configurations. Derivative information may be available, either through the running of an appropriate adjoint model or as a result of some analysis previously performed. An emulator would likely benefit from the inclusion of this derivative information. Whether further efficiency is achieved, however, depends on the relation between the computational cost of obtaining the derivatives and the value of the derivative information in the emulator. In our examples we see that derivatives are more valuable in models which have shorter correlation lengths and emulators without derivatives generally tend to require twice as many model runs as the emulators with derivatives to produce a similar predictive performance. We conclude that an optimal solution is likely to be a hybrid design consisting of adjoint runs in some parts of the input space and standard model runs in others.
The knowledge of the derivatives of complex models can add greatly to their utility, for example in the application of sensitivity analysis or data assimilation. One way of generating such derivatives, as suggested above, is by coding an adjoint model. Despite automatic differentiation software, this remains a complex task and the adjoint model when written is computationally more demanding.
We suggest an alternative method for generating partial derivatives of complex model output, with respect to model inputs. We propose the use of a Gaussian process emulator which, as long as the model is suitable for emulation, can be used to estimate derivatives even without any derivative information known a priori. We present encouraging results which show how an emulator of derivatives could reduce the demand for writing and running adjoint models. This is done with the use of both toy models and the climate model C-GOLDSTEIN.
Stephenson, Gemma
1f755d21-7dc3-4e58-942f-47e4d7f197a8
July 2010
Stephenson, Gemma
1f755d21-7dc3-4e58-942f-47e4d7f197a8
Challenor, Peter
a7e71e56-8391-442c-b140-6e4b90c33547
Stephenson, Gemma
(2010)
Using derivative information in the statistical analysis of computer models.
University of Southampton, School of Ocean and Earth Science, Doctoral Thesis, 212pp.
Record type:
Thesis
(Doctoral)
Abstract
Complex deterministic models are an important tool for studying a wide range of systems. Often though, such models are computationally too expensive to perform the many runs required. In this case one option is to build a Gaussian process emulator which acts as a surrogate, enabling fast prediction of the model output at specified input configurations. Derivative information may be available, either through the running of an appropriate adjoint model or as a result of some analysis previously performed. An emulator would likely benefit from the inclusion of this derivative information. Whether further efficiency is achieved, however, depends on the relation between the computational cost of obtaining the derivatives and the value of the derivative information in the emulator. In our examples we see that derivatives are more valuable in models which have shorter correlation lengths and emulators without derivatives generally tend to require twice as many model runs as the emulators with derivatives to produce a similar predictive performance. We conclude that an optimal solution is likely to be a hybrid design consisting of adjoint runs in some parts of the input space and standard model runs in others.
The knowledge of the derivatives of complex models can add greatly to their utility, for example in the application of sensitivity analysis or data assimilation. One way of generating such derivatives, as suggested above, is by coding an adjoint model. Despite automatic differentiation software, this remains a complex task and the adjoint model when written is computationally more demanding.
We suggest an alternative method for generating partial derivatives of complex model output, with respect to model inputs. We propose the use of a Gaussian process emulator which, as long as the model is suitable for emulation, can be used to estimate derivatives even without any derivative information known a priori. We present encouraging results which show how an emulator of derivatives could reduce the demand for writing and running adjoint models. This is done with the use of both toy models and the climate model C-GOLDSTEIN.
Text
GEMMAthesis.pdf
- Other
More information
Published date: July 2010
Organisations:
University of Southampton
Identifiers
Local EPrints ID: 169027
URI: http://eprints.soton.ac.uk/id/eprint/169027
PURE UUID: 588ecd76-19ce-4fb1-ad5b-d43439cce50c
Catalogue record
Date deposited: 08 Dec 2010 14:13
Last modified: 14 Mar 2024 02:19
Export record
Contributors
Author:
Gemma Stephenson
Thesis advisor:
Peter Challenor
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics