Benchmarking classification models for software defect prediction: a proposed framework and novel findings
Benchmarking classification models for software defect prediction: a proposed framework and novel findings
Software defect prediction strives to improve software quality and testing efficiency by constructing predictive classification models from code attributes to enable a timely identification of fault-prone modules. Several classification models have been evaluated for this task. However, due to inconsistent findings regarding the superiority of one classifier over another and the usefulness of metric-based classification in general, more research is needed to improve convergence across studies and further advance confidence in experimental results. We consider three potential sources for bias: comparing classifiers over one or a small number of proprietary datasets, relying on accuracy indicators that are conceptually inappropriate for software defect prediction and cross-study comparisons, and finally, limited use of statistical testing procedures to secure empirical findings. To remedy these problems, a framework for comparative software defect prediction experiments is proposed and applied in a large-scale empirical comparison of 22 classifiers over ten public domain datasets from the NASA Metrics Data repository. Our results indicate that the importance of the particular classification algorithm may have been overestimated in previous research since no significant performance differences could be detected among the top-17 classifiers.
complexity measures, data mining, formal methods, statistical methods
485-496
Lessmann, Stefan
3b9f8133-67bb-4bcc-9183-e1a5db294b01
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Pietsch, Swantje
606014c4-78a5-4686-b614-79d748abb241
July 2008
Lessmann, Stefan
3b9f8133-67bb-4bcc-9183-e1a5db294b01
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Mues, Christophe
07438e46-bad6-48ba-8f56-f945bc2ff934
Pietsch, Swantje
606014c4-78a5-4686-b614-79d748abb241
Lessmann, Stefan, Baesens, Bart, Mues, Christophe and Pietsch, Swantje
(2008)
Benchmarking classification models for software defect prediction: a proposed framework and novel findings.
IEEE Transactions on Software Engineering, 34 (4), .
(doi:10.1109/TSE.2008.35).
Abstract
Software defect prediction strives to improve software quality and testing efficiency by constructing predictive classification models from code attributes to enable a timely identification of fault-prone modules. Several classification models have been evaluated for this task. However, due to inconsistent findings regarding the superiority of one classifier over another and the usefulness of metric-based classification in general, more research is needed to improve convergence across studies and further advance confidence in experimental results. We consider three potential sources for bias: comparing classifiers over one or a small number of proprietary datasets, relying on accuracy indicators that are conceptually inappropriate for software defect prediction and cross-study comparisons, and finally, limited use of statistical testing procedures to secure empirical findings. To remedy these problems, a framework for comparative software defect prediction experiments is proposed and applied in a large-scale empirical comparison of 22 classifiers over ten public domain datasets from the NASA Metrics Data repository. Our results indicate that the importance of the particular classification algorithm may have been overestimated in previous research since no significant performance differences could be detected among the top-17 classifiers.
This record has no associated files available for download.
More information
e-pub ahead of print date: 23 May 2008
Published date: July 2008
Keywords:
complexity measures, data mining, formal methods, statistical methods
Organisations:
Management
Identifiers
Local EPrints ID: 63006
URI: http://eprints.soton.ac.uk/id/eprint/63006
PURE UUID: bb7eb90c-683c-48b2-aa8f-ffbc3b288fd8
Catalogue record
Date deposited: 14 Oct 2008
Last modified: 16 Mar 2024 03:40
Export record
Altmetrics
Contributors
Author:
Stefan Lessmann
Author:
Swantje Pietsch
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics