Fitting insurance and economic data with outliers: a flexible approach based on finite mixtures of contaminated gamma distributions
Fitting insurance and economic data with outliers: a flexible approach based on finite mixtures of contaminated gamma distributions
Insurance and economic data are frequently characterized by positivity, skewness, leptokurtosis, and multi-modality; although many parametric models have been used in the literature, often these peculiarities call for more flexible approaches. Here, we propose a finite mixture of contaminated gamma distributions that provides a better characterization of data. It is placed in between parametric and non-parametric density estimation and strikes a balance between these alternatives, as a large class of densities can be implemented. We adopt a maximum likelihood approach to estimate the model parameters, providing the likelihood and the expected-maximization algorithm implemented to estimate all unknown parameters. We apply our approach to an artificial dataset and to two well-known datasets as the workers compensation data and the healthcare expenditure data taken from the medical expenditure panel survey. The Value-at-Risk is evaluated and comparisons with other benchmark models are provided.
Contaminated distributions, finite mixtures, gamma distribution, insurance claims, robust estimation
1-22
Punzo, Antonio
1138a0c8-cc0b-4f02-8409-957de3bd1fed
Mazza, Angelo
01e61663-b39b-452e-a076-475a50fa0af6
Maruotti, Antonello
7096256c-fa1b-4cc1-9ca4-1a60cc3ee12e
Punzo, Antonio
1138a0c8-cc0b-4f02-8409-957de3bd1fed
Mazza, Angelo
01e61663-b39b-452e-a076-475a50fa0af6
Maruotti, Antonello
7096256c-fa1b-4cc1-9ca4-1a60cc3ee12e
Punzo, Antonio, Mazza, Angelo and Maruotti, Antonello
(2018)
Fitting insurance and economic data with outliers: a flexible approach based on finite mixtures of contaminated gamma distributions.
Journal of Applied Statistics, .
(doi:10.1080/02664763.2018.1428288).
Abstract
Insurance and economic data are frequently characterized by positivity, skewness, leptokurtosis, and multi-modality; although many parametric models have been used in the literature, often these peculiarities call for more flexible approaches. Here, we propose a finite mixture of contaminated gamma distributions that provides a better characterization of data. It is placed in between parametric and non-parametric density estimation and strikes a balance between these alternatives, as a large class of densities can be implemented. We adopt a maximum likelihood approach to estimate the model parameters, providing the likelihood and the expected-maximization algorithm implemented to estimate all unknown parameters. We apply our approach to an artificial dataset and to two well-known datasets as the workers compensation data and the healthcare expenditure data taken from the medical expenditure panel survey. The Value-at-Risk is evaluated and comparisons with other benchmark models are provided.
This record has no associated files available for download.
More information
Accepted/In Press date: 10 January 2018
e-pub ahead of print date: 28 January 2018
Keywords:
Contaminated distributions, finite mixtures, gamma distribution, insurance claims, robust estimation
Identifiers
Local EPrints ID: 417949
URI: http://eprints.soton.ac.uk/id/eprint/417949
ISSN: 0266-4763
PURE UUID: 98616a1c-41a6-4368-ad43-b1abb55037f2
Catalogue record
Date deposited: 19 Feb 2018 17:30
Last modified: 15 Mar 2024 18:26
Export record
Altmetrics
Contributors
Author:
Antonio Punzo
Author:
Angelo Mazza
Author:
Antonello Maruotti
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics