Fitting limit lines (envelope curves) to spreads of geoenvironmental data
Fitting limit lines (envelope curves) to spreads of geoenvironmental data
Geoscientists frequently are interested in defining the overall trend in x-y data clouds using techniques such as least squares regression. Yet often the sample data exhibits considerable spread of y-values for given x-values, which is itself of interest. In some cases the data may exhibit a distinct visual upper (or lower) ‘limit’ to a broad spread of y-values for a given x-value, defined by a marked reduction in concentration of y-values. As a function of x-value, the locus of this “limit” defines a “limit line”, with no (or few) points lying above (or below) it. Despite numerous examples of such situations in geoscience, there has been little consideration within the general geoenvironmental literature of methods used to define limit lines (sometimes termed ‘envelope curves’ when they enclose all data of interest). In this work, methods to fit limit lines are reviewed. Many commonly applied methods are ad-hoc and statistically not well founded, often because the data sample available is small and noisy. Other methods are considered which correspond to specific statistical models offering more objective and reproducible estimation. The strengths and weaknesses of methods are considered by application to real geoscience data sets. Wider adoption of statistical models would enhance confidence in the utility of fitted limits and promote statistical developments in limit fitting methodologies which are likely to be transformative in the interpretation of limits. Supplements, a spreadsheet and references to software are provided for ready application by geoscientists.
Limit lines; Envelope curves; Trimming method; Quantile regression; Non-parametric maximum likelihood methods.
Carling, Paul
8d252dd9-3c88-4803-81cc-c2ec4c6fa687
Jonathan, Philip
1b1bcf63-c449-46ae-b5ae-2c5faa623229
Su, Teng
d360de58-24cc-4bf4-b5bf-48dc47dac4e1
Carling, Paul
8d252dd9-3c88-4803-81cc-c2ec4c6fa687
Jonathan, Philip
1b1bcf63-c449-46ae-b5ae-2c5faa623229
Su, Teng
d360de58-24cc-4bf4-b5bf-48dc47dac4e1
Carling, Paul, Jonathan, Philip and Su, Teng
(2021)
Fitting limit lines (envelope curves) to spreads of geoenvironmental data.
Progress in Physical Geography.
(In Press)
Abstract
Geoscientists frequently are interested in defining the overall trend in x-y data clouds using techniques such as least squares regression. Yet often the sample data exhibits considerable spread of y-values for given x-values, which is itself of interest. In some cases the data may exhibit a distinct visual upper (or lower) ‘limit’ to a broad spread of y-values for a given x-value, defined by a marked reduction in concentration of y-values. As a function of x-value, the locus of this “limit” defines a “limit line”, with no (or few) points lying above (or below) it. Despite numerous examples of such situations in geoscience, there has been little consideration within the general geoenvironmental literature of methods used to define limit lines (sometimes termed ‘envelope curves’ when they enclose all data of interest). In this work, methods to fit limit lines are reviewed. Many commonly applied methods are ad-hoc and statistically not well founded, often because the data sample available is small and noisy. Other methods are considered which correspond to specific statistical models offering more objective and reproducible estimation. The strengths and weaknesses of methods are considered by application to real geoscience data sets. Wider adoption of statistical models would enhance confidence in the utility of fitted limits and promote statistical developments in limit fitting methodologies which are likely to be transformative in the interpretation of limits. Supplements, a spreadsheet and references to software are provided for ready application by geoscientists.
Text
Accepted manuscript with figures
- Accepted Manuscript
More information
Accepted/In Press date: 14 October 2021
Keywords:
Limit lines; Envelope curves; Trimming method; Quantile regression; Non-parametric maximum likelihood methods.
Identifiers
Local EPrints ID: 452024
URI: http://eprints.soton.ac.uk/id/eprint/452024
ISSN: 0309-1333
PURE UUID: fb77803d-39af-4f09-9e16-f65bc4623ece
Catalogue record
Date deposited: 09 Nov 2021 17:30
Last modified: 16 Mar 2024 14:24
Export record
Contributors
Author:
Philip Jonathan
Author:
Teng Su
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics