From linear to non-linear kernel based classifiers for bankruptcy prediction
From linear to non-linear kernel based classifiers for bankruptcy prediction
Bankruptcy prediction has been a topic of research for decades, both within the financial and the academic world. The implementations of international financial and accounting standards, such as Basel II and IFRS, as well as the recent credit crisis, have accentuated this topic even further. This paper describes both regularized and non-linear kernel variants of traditional discriminant analysis techniques, such as logistic regression, Fisher discriminant analysis (FDA) and quadratic discriminant analysis (QDA). Next to a systematic description of these variants, we contribute to the literature by introducing kernel QDA and providing a comprehensive benchmarking study of these classification techniques and their regularized and kernel versions for bankruptcy prediction using 10 real-life data sets. Performance is compared in terms of binary classification accuracy, relevant for evaluating yes/no credit decisions and in terms of classification accuracy, relevant for pricing differentiated credit granting. The results clearly indicate the significant improvement for kernel variants in both percentage correctly classified (PCC) test instances and area under the ROC curve (AUC), and indicate that bankruptcy problems are weakly non-linear. On average, the best performance is achieved by LSSVM, closely followed by kernel quadratic discriminant analysis. Given the high impact of small improvements in performance, we show the relevance and importance of considering kernel techniques within this setting. Further experiments with backwards input selection improve our results even further. Finally, we experimentally investigate the relative ranking of the different categories of variables: liquidity, solvency, profitability and various, and as such provide new insights into the relative importance of these categories for predicting financial distress.
kernel classifiers, least squares-support vector machine, bankruptcy prediction
2955-2970
Van Gestel, Tony
e917bd96-d291-4132-958b-e54cb1b9eaf9
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Martens, David
42e7e141-fb3d-4ead-8e3a-96b39bab65f9
October 2010
Van Gestel, Tony
e917bd96-d291-4132-958b-e54cb1b9eaf9
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Martens, David
42e7e141-fb3d-4ead-8e3a-96b39bab65f9
Van Gestel, Tony, Baesens, Bart and Martens, David
(2010)
From linear to non-linear kernel based classifiers for bankruptcy prediction.
[in special issue: 10th Brazilian Symposium on Neural Networks (SBRN2008)]
Neurocomputing, 73 (16-18), .
(doi:10.1016/j.neucom.2010.07.002).
Abstract
Bankruptcy prediction has been a topic of research for decades, both within the financial and the academic world. The implementations of international financial and accounting standards, such as Basel II and IFRS, as well as the recent credit crisis, have accentuated this topic even further. This paper describes both regularized and non-linear kernel variants of traditional discriminant analysis techniques, such as logistic regression, Fisher discriminant analysis (FDA) and quadratic discriminant analysis (QDA). Next to a systematic description of these variants, we contribute to the literature by introducing kernel QDA and providing a comprehensive benchmarking study of these classification techniques and their regularized and kernel versions for bankruptcy prediction using 10 real-life data sets. Performance is compared in terms of binary classification accuracy, relevant for evaluating yes/no credit decisions and in terms of classification accuracy, relevant for pricing differentiated credit granting. The results clearly indicate the significant improvement for kernel variants in both percentage correctly classified (PCC) test instances and area under the ROC curve (AUC), and indicate that bankruptcy problems are weakly non-linear. On average, the best performance is achieved by LSSVM, closely followed by kernel quadratic discriminant analysis. Given the high impact of small improvements in performance, we show the relevance and importance of considering kernel techniques within this setting. Further experiments with backwards input selection improve our results even further. Finally, we experimentally investigate the relative ranking of the different categories of variables: liquidity, solvency, profitability and various, and as such provide new insights into the relative importance of these categories for predicting financial distress.
This record has no associated files available for download.
More information
Published date: October 2010
Keywords:
kernel classifiers, least squares-support vector machine, bankruptcy prediction
Organisations:
Southampton Business School
Identifiers
Local EPrints ID: 336477
URI: http://eprints.soton.ac.uk/id/eprint/336477
ISSN: 0925-2312
PURE UUID: 2575a715-dd82-4df5-9f7a-eef595384e6e
Catalogue record
Date deposited: 27 Mar 2012 13:01
Last modified: 15 Mar 2024 03:20
Export record
Altmetrics
Contributors
Author:
Tony Van Gestel
Author:
David Martens
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics