An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models
An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models
An important objective of data mining is the development of predictive models. Based on a number of observations, a model is constructed that allows the analysts to provide classifications or predictions for new observations. Currently, most research focuses on improving the accuracy or precision of these models and comparatively little research has been undertaken to increase their comprehensibility to the analyst or end-user. This is mainly due to the subjective nature of ‘comprehensibility’, which depends on many factors outside the model, such as the user's experience and his/her prior knowledge. Despite this influence of the observer, some representation formats are generally considered to be more easily interpretable than others. In this paper, an empirical study is presented which investigates the suitability of a number of alternative representation formats for classification when interpretability is a key requirement. The formats under consideration are decision tables, (binary) decision trees, propositional rules, and oblique rules. An end-user experiment was designed to test the accuracy, response time, and answer confidence for a set of problem-solving tasks involving the former representations. Analysis of the results reveals that decision tables perform significantly better on all three criteria, while post-test voting also reveals a clear preference of users for decision tables in terms of ease of use.
141-154
Huysmans, Johan
0a2bb876-e5bc-42c5-bcf1-18f089a6eec3
Dejaeger, Karel
491728b5-118c-4661-b003-3787037c8f2d
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
2011
Huysmans, Johan
0a2bb876-e5bc-42c5-bcf1-18f089a6eec3
Dejaeger, Karel
491728b5-118c-4661-b003-3787037c8f2d
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Huysmans, Johan, Dejaeger, Karel, Mues, Christophe, Vanthienen, Jan and Baesens, Bart
(2011)
An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models.
Decision Support Systems, 51 (1), .
(doi:10.1016/j.dss.2010.12.003).
Abstract
An important objective of data mining is the development of predictive models. Based on a number of observations, a model is constructed that allows the analysts to provide classifications or predictions for new observations. Currently, most research focuses on improving the accuracy or precision of these models and comparatively little research has been undertaken to increase their comprehensibility to the analyst or end-user. This is mainly due to the subjective nature of ‘comprehensibility’, which depends on many factors outside the model, such as the user's experience and his/her prior knowledge. Despite this influence of the observer, some representation formats are generally considered to be more easily interpretable than others. In this paper, an empirical study is presented which investigates the suitability of a number of alternative representation formats for classification when interpretability is a key requirement. The formats under consideration are decision tables, (binary) decision trees, propositional rules, and oblique rules. An end-user experiment was designed to test the accuracy, response time, and answer confidence for a set of problem-solving tasks involving the former representations. Analysis of the results reveals that decision tables perform significantly better on all three criteria, while post-test voting also reveals a clear preference of users for decision tables in terms of ease of use.
This record has no associated files available for download.
More information
Published date: 2011
Organisations:
Management
Identifiers
Local EPrints ID: 169001
URI: http://eprints.soton.ac.uk/id/eprint/169001
ISSN: 0167-9236
PURE UUID: a45c01c0-4611-480e-9a89-9c3449aa1bc3
Catalogue record
Date deposited: 16 Dec 2010 15:07
Last modified: 14 Mar 2024 02:49
Export record
Altmetrics
Contributors
Author:
Johan Huysmans
Author:
Karel Dejaeger
Author:
Jan Vanthienen
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics