The University of Southampton
University of Southampton Institutional Repository

Domain knowledge integration in data mining using decision tables: case studies in churn prediction

Domain knowledge integration in data mining using decision tables: case studies in churn prediction
Domain knowledge integration in data mining using decision tables: case studies in churn prediction
Companies' interest in customer relationship modelling and key issues such as customer lifetime value and churn has substantially increased over the years. However, the complexity of building, interpreting and applying these models creates obstacles for their implementation. The main contribution of this paper is to show how domain knowledge can be incorporated in the data mining process for churn prediction, viz. through the evaluation of coefficient signs in a logistic regression model, and secondly, by analysing a decision table (DT) extracted from a decision tree or rule-based classifier. An algorithm to check DTs for violations of monotonicity constraints is presented, which involves the repeated application of condition reordering and table contraction to detect counter-intuitive patterns. Both approaches are applied to two telecom data sets to empirically demonstrate how domain knowledge can be used to ensure the interpretability of the resulting models.
domain knowledge, data mining, churn, decision tables
0160-5682
1096-1106
Lima, E.
238ecc65-2bc4-4826-91d9-b712baaf0cd2
Mues, C.
07438e46-bad6-48ba-8f56-f945bc2ff934
Baesens, B.
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Lima, E.
238ecc65-2bc4-4826-91d9-b712baaf0cd2
Mues, C.
07438e46-bad6-48ba-8f56-f945bc2ff934
Baesens, B.
f7c6496b-aa7f-4026-8616-ca61d9e216f0

Lima, E., Mues, C. and Baesens, B. (2009) Domain knowledge integration in data mining using decision tables: case studies in churn prediction. [in special issue: Data Mining and Operational Research: Techniques and Applications] Journal of the Operational Research Society, 60 (8), 1096-1106. (doi:10.1057/jors.2008.161).

Record type: Article

Abstract

Companies' interest in customer relationship modelling and key issues such as customer lifetime value and churn has substantially increased over the years. However, the complexity of building, interpreting and applying these models creates obstacles for their implementation. The main contribution of this paper is to show how domain knowledge can be incorporated in the data mining process for churn prediction, viz. through the evaluation of coefficient signs in a logistic regression model, and secondly, by analysing a decision table (DT) extracted from a decision tree or rule-based classifier. An algorithm to check DTs for violations of monotonicity constraints is presented, which involves the repeated application of condition reordering and table contraction to detect counter-intuitive patterns. Both approaches are applied to two telecom data sets to empirically demonstrate how domain knowledge can be used to ensure the interpretability of the resulting models.

This record has no associated files available for download.

More information

e-pub ahead of print date: 18 February 2009
Published date: 29 August 2009
Keywords: domain knowledge, data mining, churn, decision tables
Organisations: Management

Identifiers

Local EPrints ID: 152557
URI: http://eprints.soton.ac.uk/id/eprint/152557
ISSN: 0160-5682
PURE UUID: a4cb245d-8a66-4b45-8a24-99363bf91a56
ORCID for C. Mues: ORCID iD orcid.org/0000-0002-6289-5490
ORCID for B. Baesens: ORCID iD orcid.org/0000-0002-5831-5668

Catalogue record

Date deposited: 14 May 2010 15:10
Last modified: 14 Mar 2024 02:49

Export record

Altmetrics

Contributors

Author: E. Lima
Author: C. Mues ORCID iD
Author: B. Baesens ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×