The University of Southampton
University of Southampton Institutional Repository

Bayesian analysis of daily maximum ozone levels

Bayesian analysis of daily maximum ozone levels
Bayesian analysis of daily maximum ozone levels
Ground level ozone is one of the six criteria primary pollutants that is monitored by the United States Environmental Protection Agency. Statistical methods are increasingly being used to model ground level ozone concentration data. This thesis is motivated by the need to perform practical data analysis, and to develop methods for modelling of ozone concentration data observed over a vast study region in the eastern United States (US.

For the purposes of analysis, we use two space-time modelling strategies: the dynamic linear models (DLM) and the auto-regressive (AR) models and obtain predictions and forecasts for set aside validation data. These methods are developed under the Bayesian paradigm and MCMC sampling techniques are used to explore the posterior and predictive distributions. Particularly, for analysis, we use a subset data set from the state of New York to illustrate the methods. Both the DLM and AR modelling approaches are compared in detail using the predictive and forecast distributions induced by them. The comparisons are facilitated by a number of theoretical results. These show better properties for the AR models under some conditions, which have been shown to hold for the real life example that we considered. To address the challenge of modelling large dimensional spatio-temporal ozone concentration data, we adopt Gaussian predictive processes (GPP) technique and propose a rich hierarchical spatio-temporal AR model. The important utility of this method lies in the ability to predict the primary ozone standard at any given location for the modelled period from 1997-2006 in the eastern US. Different sensitivity analyses are performed, and, in addition, hold-out data sets are used for model validation. Specifically, this new modelling approach has been illustrated for evaluating meteorologically adjusted trends in the primary ozone standard in the eastern US over the 10 year period. This helps in understanding spatial patterns and trends in ozone levels, which in turn will help in evaluating emission reduction policies that directly affect many industries. Forecasting of ozone levels is also an important problem in air pollution monitoring. We compare different spatio-temporal models for their forecasting abilities. The GPP based models provide the best forecast for set aside validation data. In addition, in this thesis we use computer simulation model output as an explanatory variable for modelling the observed ozone data.

Thus, the proposed methods can also be seen as a spatio-temporal downscaler model for incorporating output from numerical models, where the grid-level output from numerical models is used as a covariate in the point level model for observed data. This type of space and time varying covariate information enriches the regression settings like the methods used in this thesis. Currently there is no package available that can fit space-time environmental data using Bayesian hierarchical spatio-temporal models. In this thesis we, therefore, develop a software package named spTimer in R. The spTimer package with its ability to fit, predict and forecast using a number of Bayesian hierarchical space-time models can be used for modelling a wide variety of large space-time environmental data. This package is built in C language to be computationally efficient. However, this C-code is hidden from the user and the methods can be implemented by anyone familiar with the R language. This thesis can be extended in several ways for example, for multivariate data, for non-Gaussian first stage data, and for data observed in environmental monitoring of stream networks.
Bakar, K.S.
85166621-61c6-4d76-a274-3f3b57a36f56
Bakar, K.S.
85166621-61c6-4d76-a274-3f3b57a36f56
Sahu, Sujit K.
33f1386d-6d73-4b60-a796-d626721f72bf

Bakar, K.S. (2012) Bayesian analysis of daily maximum ozone levels. University of Southampton, Mathematics, Doctoral Thesis, 212pp.

Record type: Thesis (Doctoral)

Abstract

Ground level ozone is one of the six criteria primary pollutants that is monitored by the United States Environmental Protection Agency. Statistical methods are increasingly being used to model ground level ozone concentration data. This thesis is motivated by the need to perform practical data analysis, and to develop methods for modelling of ozone concentration data observed over a vast study region in the eastern United States (US.

For the purposes of analysis, we use two space-time modelling strategies: the dynamic linear models (DLM) and the auto-regressive (AR) models and obtain predictions and forecasts for set aside validation data. These methods are developed under the Bayesian paradigm and MCMC sampling techniques are used to explore the posterior and predictive distributions. Particularly, for analysis, we use a subset data set from the state of New York to illustrate the methods. Both the DLM and AR modelling approaches are compared in detail using the predictive and forecast distributions induced by them. The comparisons are facilitated by a number of theoretical results. These show better properties for the AR models under some conditions, which have been shown to hold for the real life example that we considered. To address the challenge of modelling large dimensional spatio-temporal ozone concentration data, we adopt Gaussian predictive processes (GPP) technique and propose a rich hierarchical spatio-temporal AR model. The important utility of this method lies in the ability to predict the primary ozone standard at any given location for the modelled period from 1997-2006 in the eastern US. Different sensitivity analyses are performed, and, in addition, hold-out data sets are used for model validation. Specifically, this new modelling approach has been illustrated for evaluating meteorologically adjusted trends in the primary ozone standard in the eastern US over the 10 year period. This helps in understanding spatial patterns and trends in ozone levels, which in turn will help in evaluating emission reduction policies that directly affect many industries. Forecasting of ozone levels is also an important problem in air pollution monitoring. We compare different spatio-temporal models for their forecasting abilities. The GPP based models provide the best forecast for set aside validation data. In addition, in this thesis we use computer simulation model output as an explanatory variable for modelling the observed ozone data.

Thus, the proposed methods can also be seen as a spatio-temporal downscaler model for incorporating output from numerical models, where the grid-level output from numerical models is used as a covariate in the point level model for observed data. This type of space and time varying covariate information enriches the regression settings like the methods used in this thesis. Currently there is no package available that can fit space-time environmental data using Bayesian hierarchical spatio-temporal models. In this thesis we, therefore, develop a software package named spTimer in R. The spTimer package with its ability to fit, predict and forecast using a number of Bayesian hierarchical space-time models can be used for modelling a wide variety of large space-time environmental data. This package is built in C language to be computationally efficient. However, this C-code is hidden from the user and the methods can be implemented by anyone familiar with the R language. This thesis can be extended in several ways for example, for multivariate data, for non-Gaussian first stage data, and for data observed in environmental monitoring of stream networks.

Text
Thesis_KS-Bakar.pdf - Other
Download (8MB)

More information

Published date: June 2012
Organisations: University of Southampton, Mathematical Sciences

Identifiers

Local EPrints ID: 340039
URI: http://eprints.soton.ac.uk/id/eprint/340039
PURE UUID: f174b98d-b41d-41f5-9254-61a32c8313cf
ORCID for Sujit K. Sahu: ORCID iD orcid.org/0000-0003-2315-3598

Catalogue record

Date deposited: 03 Oct 2012 15:42
Last modified: 15 Mar 2024 03:05

Export record

Contributors

Author: K.S. Bakar
Thesis advisor: Sujit K. Sahu ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×