Calibrated computation-aware Gaussian processes
Calibrated computation-aware Gaussian processes
Gaussian processes are notorious for scaling cubically with the size of the training set, preventing application to very large regression problems. Computation-aware Gaussian processes (CAGPs) tackle this scaling issue by exploiting probabilistic linear solvers to reduce complexity, widening the posterior with additional computational uncertainty due to reduced computation. However, the most commonly used CAGP framework results in (sometimes dramatically) conservative uncertainty quantification, making the posterior unrealistic in practice. In this work, we prove that if the utilised probabilistic linear solver is calibrated, in a rigorous statistical sense, then so too is the induced CAGP. We thus propose a new CAGP framework, CAGP-GS, based on using Gauss-Seidel iterations for the underlying probabilistic linear solver. CAGP-GS performs favourably compared to existing approaches when the test set is low-dimensional and few iterations are performed. We test the calibratedness on a synthetic problem, and compare the performance to existing approaches on a large-scale global temperature regression problem.
stat.ML, cs.LG, cs.NA, math.NA
Hegde, Disha
5e7d8e1b-5b2a-4828-9e49-42e9e94c9725
Adil, Mohamed
07685c7f-be20-416e-af93-184f4d2ee3d5
Cockayne, Jon
da87c8b2-fafb-4856-938d-50be8f0e4a5b
11 October 2024
Hegde, Disha
5e7d8e1b-5b2a-4828-9e49-42e9e94c9725
Adil, Mohamed
07685c7f-be20-416e-af93-184f4d2ee3d5
Cockayne, Jon
da87c8b2-fafb-4856-938d-50be8f0e4a5b
[Unknown type: UNSPECIFIED]
Abstract
Gaussian processes are notorious for scaling cubically with the size of the training set, preventing application to very large regression problems. Computation-aware Gaussian processes (CAGPs) tackle this scaling issue by exploiting probabilistic linear solvers to reduce complexity, widening the posterior with additional computational uncertainty due to reduced computation. However, the most commonly used CAGP framework results in (sometimes dramatically) conservative uncertainty quantification, making the posterior unrealistic in practice. In this work, we prove that if the utilised probabilistic linear solver is calibrated, in a rigorous statistical sense, then so too is the induced CAGP. We thus propose a new CAGP framework, CAGP-GS, based on using Gauss-Seidel iterations for the underlying probabilistic linear solver. CAGP-GS performs favourably compared to existing approaches when the test set is low-dimensional and few iterations are performed. We test the calibratedness on a synthetic problem, and compare the performance to existing approaches on a large-scale global temperature regression problem.
Text
2410.08796v1
- Author's Original
More information
Published date: 11 October 2024
Keywords:
stat.ML, cs.LG, cs.NA, math.NA
Identifiers
Local EPrints ID: 496515
URI: http://eprints.soton.ac.uk/id/eprint/496515
PURE UUID: 918d4ae4-cedd-4693-aec7-b89a04f279ea
Catalogue record
Date deposited: 17 Dec 2024 17:41
Last modified: 18 Dec 2024 03:13
Export record
Altmetrics
Contributors
Author:
Disha Hegde
Author:
Mohamed Adil
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics