The University of Southampton
University of Southampton Institutional Repository

Comprehensible credit scoring models using rule extraction from support vector machines

Comprehensible credit scoring models using rule extraction from support vector machines
Comprehensible credit scoring models using rule extraction from support vector machines
In recent years, support vector machines (SVMs) were successfully applied to a wide range of applications. However, since the classifier is described as a complex mathematical function, it is rather incomprehensible for humans. This opacity property prevents them from being used in many real-life applications where both accuracy and comprehensibility are required, such as medical diagnosis and credit risk evaluation. To overcome this limitation, rules can be extracted from the trained SVM that are interpretable by humans and keep as much of the accuracy of the SVM as possible. In this paper, we will provide an overview of the recently proposed rule extraction techniques for SVMs and introduce two others taken from the artificial neural networks domain, being Trepan and G-REX. The described techniques are compared using publicly available datasets, such as Ripley’s synthetic dataset and the multi-class iris dataset. We will also look at medical diagnosis and credit scoring where comprehensibility is a key requirement and even a regulatory recommendation. Our experiments show that the SVM rule extraction techniques lose only a small percentage in performance compared to SVMs and therefore rank at the top of comprehensible classification techniques.
credit scoring, classification, support vector machine, rule extraction
0377-2217
1466-1476
Martens, David
42e7e141-fb3d-4ead-8e3a-96b39bab65f9
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Van Gestel, Tony
e917bd96-d291-4132-958b-e54cb1b9eaf9
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
Martens, David
42e7e141-fb3d-4ead-8e3a-96b39bab65f9
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Van Gestel, Tony
e917bd96-d291-4132-958b-e54cb1b9eaf9
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d

Martens, David, Baesens, Bart, Van Gestel, Tony and Vanthienen, Jan (2007) Comprehensible credit scoring models using rule extraction from support vector machines. European Journal of Operational Research, 183 (3), 1466-1476. (doi:10.1016/j.ejor.2006.04.051).

Record type: Article

Abstract

In recent years, support vector machines (SVMs) were successfully applied to a wide range of applications. However, since the classifier is described as a complex mathematical function, it is rather incomprehensible for humans. This opacity property prevents them from being used in many real-life applications where both accuracy and comprehensibility are required, such as medical diagnosis and credit risk evaluation. To overcome this limitation, rules can be extracted from the trained SVM that are interpretable by humans and keep as much of the accuracy of the SVM as possible. In this paper, we will provide an overview of the recently proposed rule extraction techniques for SVMs and introduce two others taken from the artificial neural networks domain, being Trepan and G-REX. The described techniques are compared using publicly available datasets, such as Ripley’s synthetic dataset and the multi-class iris dataset. We will also look at medical diagnosis and credit scoring where comprehensibility is a key requirement and even a regulatory recommendation. Our experiments show that the SVM rule extraction techniques lose only a small percentage in performance compared to SVMs and therefore rank at the top of comprehensible classification techniques.

This record has no associated files available for download.

More information

Published date: 16 December 2007
Keywords: credit scoring, classification, support vector machine, rule extraction

Identifiers

Local EPrints ID: 51706
URI: http://eprints.soton.ac.uk/id/eprint/51706
ISSN: 0377-2217
PURE UUID: 04bfd76d-c655-4702-8fae-bef4fe0b26f1
ORCID for Bart Baesens: ORCID iD orcid.org/0000-0002-5831-5668

Catalogue record

Date deposited: 06 Jun 2008
Last modified: 16 Mar 2024 03:39

Export record

Altmetrics

Contributors

Author: David Martens
Author: Bart Baesens ORCID iD
Author: Tony Van Gestel
Author: Jan Vanthienen

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×