The University of Southampton
University of Southampton Institutional Repository

An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models

An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models
An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models
An important objective of data mining is the development of predictive models. Based on a number of observations, a model is constructed that allows the analysts to provide classifications or predictions for new observations. Currently, most research focuses on improving the accuracy or precision of these models and comparatively little research has been undertaken to increase their comprehensibility to the analyst or end-user. This is mainly due to the subjective nature of ‘comprehensibility’, which depends on many factors outside the model, such as the user's experience and his/her prior knowledge. Despite this influence of the observer, some representation formats are generally considered to be more easily interpretable than others. In this paper, an empirical study is presented which investigates the suitability of a number of alternative representation formats for classification when interpretability is a key requirement. The formats under consideration are decision tables, (binary) decision trees, propositional rules, and oblique rules. An end-user experiment was designed to test the accuracy, response time, and answer confidence for a set of problem-solving tasks involving the former representations. Analysis of the results reveals that decision tables perform significantly better on all three criteria, while post-test voting also reveals a clear preference of users for decision tables in terms of ease of use.

0167-9236
141-154
Huysmans, Johan
0a2bb876-e5bc-42c5-bcf1-18f089a6eec3
Dejaeger, Karel
491728b5-118c-4661-b003-3787037c8f2d
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Huysmans, Johan
0a2bb876-e5bc-42c5-bcf1-18f089a6eec3
Dejaeger, Karel
491728b5-118c-4661-b003-3787037c8f2d
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Vanthienen, Jan
6f3d818f-0fce-46fa-966b-160e645caf6d
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0

Huysmans, Johan, Dejaeger, Karel, Mues, Christophe, Vanthienen, Jan and Baesens, Bart (2011) An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models. Decision Support Systems, 51 (1), 141-154. (doi:10.1016/j.dss.2010.12.003).

Record type: Article

Abstract

An important objective of data mining is the development of predictive models. Based on a number of observations, a model is constructed that allows the analysts to provide classifications or predictions for new observations. Currently, most research focuses on improving the accuracy or precision of these models and comparatively little research has been undertaken to increase their comprehensibility to the analyst or end-user. This is mainly due to the subjective nature of ‘comprehensibility’, which depends on many factors outside the model, such as the user's experience and his/her prior knowledge. Despite this influence of the observer, some representation formats are generally considered to be more easily interpretable than others. In this paper, an empirical study is presented which investigates the suitability of a number of alternative representation formats for classification when interpretability is a key requirement. The formats under consideration are decision tables, (binary) decision trees, propositional rules, and oblique rules. An end-user experiment was designed to test the accuracy, response time, and answer confidence for a set of problem-solving tasks involving the former representations. Analysis of the results reveals that decision tables perform significantly better on all three criteria, while post-test voting also reveals a clear preference of users for decision tables in terms of ease of use.

This record has no associated files available for download.

More information

Published date: 2011
Organisations: Management

Identifiers

Local EPrints ID: 169001
URI: http://eprints.soton.ac.uk/id/eprint/169001
ISSN: 0167-9236
PURE UUID: a45c01c0-4611-480e-9a89-9c3449aa1bc3
ORCID for Christophe Mues: ORCID iD orcid.org/0000-0002-6289-5490
ORCID for Bart Baesens: ORCID iD orcid.org/0000-0002-5831-5668

Catalogue record

Date deposited: 16 Dec 2010 15:07
Last modified: 14 Mar 2024 02:49

Export record

Altmetrics

Contributors

Author: Johan Huysmans
Author: Karel Dejaeger
Author: Christophe Mues ORCID iD
Author: Jan Vanthienen
Author: Bart Baesens ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×