A robust profit measure for binary classification model evaluation
A robust profit measure for binary classification model evaluation
Using profit-based evaluation measures is a necessity in business-oriented contexts, as they aid companies in making cost-optimal decisions. Among the measures that effectively include the true nature of costs and benefits in binary classification, the expected maximum profit (EMP) has been used successfully for churn prediction and credit scoring, and defined in general for binary classification problems. However, despite its competitive results against the most frequently used measures, the EMP relies on a fixed probability distribution of costs and benefits, the range of which in real applications is not entirely known. In this paper, we propose to extend this measure by adding random shocks to these distributions. We call this new measure the R-EMP, following the convention of the analogous EMP measure. Our metric adds a stochastic component to each point of the cost-benefit distributions, assuming that costs and benefits have a fixed probability, but its distribution range is subject to an external shock, which can be different for each cost or benefit. The experimental set-up is focused on a credit scoring application using a dataset of a Chilean financial institution, with the attribute selection for a logistic regression being accomplished using the AUC, EMP, H-measure, and R-EMP as the selection criteria. The results indicate that the R-EMP measure is the most robust metric for achieving the greatest profit for the company under uncertain external conditions.
Supervised Binary Classification, Business analytics, Performance measures, Profit-driven analytics
154-160
Garrido, Franco
4f863fe0-af98-4550-96ca-405d3c5827e9
Verbeke, Wouter
6805dfd7-ecc9-4746-8e64-dd7eee6e3e3e
Bravo, Cristian
b22c4145-644e-40ee-85d8-431c59c3c71b
February 2018
Garrido, Franco
4f863fe0-af98-4550-96ca-405d3c5827e9
Verbeke, Wouter
6805dfd7-ecc9-4746-8e64-dd7eee6e3e3e
Bravo, Cristian
b22c4145-644e-40ee-85d8-431c59c3c71b
Garrido, Franco, Verbeke, Wouter and Bravo, Cristian
(2018)
A robust profit measure for binary classification model evaluation.
Expert Systems with Applications, 92, .
(doi:10.1016/j.eswa.2017.09.045).
Abstract
Using profit-based evaluation measures is a necessity in business-oriented contexts, as they aid companies in making cost-optimal decisions. Among the measures that effectively include the true nature of costs and benefits in binary classification, the expected maximum profit (EMP) has been used successfully for churn prediction and credit scoring, and defined in general for binary classification problems. However, despite its competitive results against the most frequently used measures, the EMP relies on a fixed probability distribution of costs and benefits, the range of which in real applications is not entirely known. In this paper, we propose to extend this measure by adding random shocks to these distributions. We call this new measure the R-EMP, following the convention of the analogous EMP measure. Our metric adds a stochastic component to each point of the cost-benefit distributions, assuming that costs and benefits have a fixed probability, but its distribution range is subject to an external shock, which can be different for each cost or benefit. The experimental set-up is focused on a credit scoring application using a dataset of a Chilean financial institution, with the attribute selection for a logistic regression being accomplished using the AUC, EMP, H-measure, and R-EMP as the selection criteria. The results indicate that the R-EMP measure is the most robust metric for achieving the greatest profit for the company under uncertain external conditions.
Text
REMP_Self_Archive
- Accepted Manuscript
More information
Accepted/In Press date: 17 September 2017
e-pub ahead of print date: 19 September 2017
Published date: February 2018
Keywords:
Supervised Binary Classification, Business analytics, Performance measures, Profit-driven analytics
Identifiers
Local EPrints ID: 414261
URI: http://eprints.soton.ac.uk/id/eprint/414261
ISSN: 0957-4174
PURE UUID: 3065eba8-e14a-4992-a9a3-87e527d1d09f
Catalogue record
Date deposited: 21 Sep 2017 16:31
Last modified: 16 Mar 2024 05:45
Export record
Altmetrics
Contributors
Author:
Franco Garrido
Author:
Wouter Verbeke
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics