Decision diagrams in machine learning: an empirical study on real-life credit-risk data
Decision diagrams in machine learning: an empirical study on real-life credit-risk data
Decision trees are a widely used knowledge representation in machine learning. However, one of their main drawbacks is the inherent replication of isomorphic subtrees, as a result of which the produced classifiers might become too large to be comprehensible by the human experts that have to validate them. Alternatively, decision diagrams, a generalization of decision trees taking on the form of a rooted, acyclic digraph instead of a tree, have occasionally been suggested as a potentially more compact representation.
Their application in machine learning has nonetheless been criticized, because the theoretical size advantages of subgraph sharing did not always directly materialize in the relatively scarce reported experiments on real-world data. Therefore, in this paper, starting from a series of rule sets extracted from three real-life credit-scoring data sets, we will empirically assess to what extent decision diagrams are able to provide a compact visual description. Furthermore, we will investigate the practical impact of finding a good attribute ordering on the achieved size savings.
decision diagrams, data mining, credit scoring
257-264
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Files, Craig M.
0de9b5d6-01f0-4d78-b318-a039f400612a
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
2004
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Files, Craig M.
0de9b5d6-01f0-4d78-b318-a039f400612a
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
Mues, Christophe, Baesens, Bart, Files, Craig M. and Vanthienen, Jan
(2004)
Decision diagrams in machine learning: an empirical study on real-life credit-risk data.
Expert Systems with Applications, 27 (2), .
(doi:10.1016/j.eswa.2004.02.001).
Abstract
Decision trees are a widely used knowledge representation in machine learning. However, one of their main drawbacks is the inherent replication of isomorphic subtrees, as a result of which the produced classifiers might become too large to be comprehensible by the human experts that have to validate them. Alternatively, decision diagrams, a generalization of decision trees taking on the form of a rooted, acyclic digraph instead of a tree, have occasionally been suggested as a potentially more compact representation.
Their application in machine learning has nonetheless been criticized, because the theoretical size advantages of subgraph sharing did not always directly materialize in the relatively scarce reported experiments on real-world data. Therefore, in this paper, starting from a series of rule sets extracted from three real-life credit-scoring data sets, we will empirically assess to what extent decision diagrams are able to provide a compact visual description. Furthermore, we will investigate the practical impact of finding a good attribute ordering on the achieved size savings.
This record has no associated files available for download.
More information
Published date: 2004
Keywords:
decision diagrams, data mining, credit scoring
Identifiers
Local EPrints ID: 35973
URI: http://eprints.soton.ac.uk/id/eprint/35973
ISSN: 0957-4174
PURE UUID: 82b88cb5-4c9d-4e8c-88a5-39d2c2556340
Catalogue record
Date deposited: 23 May 2006
Last modified: 16 Mar 2024 03:40
Export record
Altmetrics
Contributors
Author:
Craig M. Files
Author:
Jan Vanthienen
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics