Domain knowledge integration in data mining using decision tables: case studies in churn prediction
Domain knowledge integration in data mining using decision tables: case studies in churn prediction
Companies' interest in customer relationship modelling and key issues such as customer lifetime value and churn has substantially increased over the years. However, the complexity of building, interpreting and applying these models creates obstacles for their implementation. The main contribution of this paper is to show how domain knowledge can be incorporated in the data mining process for churn prediction, viz. through the evaluation of coefficient signs in a logistic regression model, and secondly, by analysing a decision table (DT) extracted from a decision tree or rule-based classifier. An algorithm to check DTs for violations of monotonicity constraints is presented, which involves the repeated application of condition reordering and table contraction to detect counter-intuitive patterns. Both approaches are applied to two telecom data sets to empirically demonstrate how domain knowledge can be used to ensure the interpretability of the resulting models.
domain knowledge, data mining, churn, decision tables
1096-1106
Lima, E.
238ecc65-2bc4-4826-91d9-b712baaf0cd2
Mues, C.
07438e46-bad6-48ba-8f56-f945bc2ff934
Baesens, B.
f7c6496b-aa7f-4026-8616-ca61d9e216f0
29 August 2009
Lima, E.
238ecc65-2bc4-4826-91d9-b712baaf0cd2
Mues, C.
07438e46-bad6-48ba-8f56-f945bc2ff934
Baesens, B.
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Lima, E., Mues, C. and Baesens, B.
(2009)
Domain knowledge integration in data mining using decision tables: case studies in churn prediction.
[in special issue: Data Mining and Operational Research: Techniques and Applications]
Journal of the Operational Research Society, 60 (8), .
(doi:10.1057/jors.2008.161).
Abstract
Companies' interest in customer relationship modelling and key issues such as customer lifetime value and churn has substantially increased over the years. However, the complexity of building, interpreting and applying these models creates obstacles for their implementation. The main contribution of this paper is to show how domain knowledge can be incorporated in the data mining process for churn prediction, viz. through the evaluation of coefficient signs in a logistic regression model, and secondly, by analysing a decision table (DT) extracted from a decision tree or rule-based classifier. An algorithm to check DTs for violations of monotonicity constraints is presented, which involves the repeated application of condition reordering and table contraction to detect counter-intuitive patterns. Both approaches are applied to two telecom data sets to empirically demonstrate how domain knowledge can be used to ensure the interpretability of the resulting models.
This record has no associated files available for download.
More information
e-pub ahead of print date: 18 February 2009
Published date: 29 August 2009
Keywords:
domain knowledge, data mining, churn, decision tables
Organisations:
Management
Identifiers
Local EPrints ID: 152557
URI: http://eprints.soton.ac.uk/id/eprint/152557
ISSN: 0160-5682
PURE UUID: a4cb245d-8a66-4b45-8a24-99363bf91a56
Catalogue record
Date deposited: 14 May 2010 15:10
Last modified: 14 Mar 2024 02:49
Export record
Altmetrics
Contributors
Author:
E. Lima
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics