Probability density function estimation based over-sampling for imbalanced two-class problems
Probability density function estimation based over-sampling for imbalanced two-class problems
A novel probability density function (PDF) estimation based over-sampling approach is proposed for two-class imbalanced classification problems. The Parzen-window kernel function is applied to estimate the PDF of the positive class, from which synthetic instances are generated as additional training data to re-balance the class distribution. Utilising the re-balanced over-sampled training data, a radial basis function (RBF) classifier is constructed by applying an orthogonal forward regression, in which the classifier's structure and the parameters of RBF kernels are determined using a particle swarm optimisation algorithm based on the criterion of minimising the leave-one-out misclassification rate. The effectiveness of the proposed approach is demonstrated by an empirical study on several imbalanced data sets.
Gao, Ming
954671c3-4167-48ad-a948-26fa5755cee3
Hong, Xia
e6551bb3-fbc0-4990-935e-43b706d8c679
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
Harris, Chris J.
c4fd3763-7b3f-4db1-9ca3-5501080f797a
2012
Gao, Ming
954671c3-4167-48ad-a948-26fa5755cee3
Hong, Xia
e6551bb3-fbc0-4990-935e-43b706d8c679
Chen, Sheng
9310a111-f79a-48b8-98c7-383ca93cbb80
Harris, Chris J.
c4fd3763-7b3f-4db1-9ca3-5501080f797a
Gao, Ming, Hong, Xia, Chen, Sheng and Harris, Chris J.
(2012)
Probability density function estimation based over-sampling for imbalanced two-class problems.
International Joint Conference on Neural Networks, Australia.
10 - 15 Jun 2012.
Record type:
Conference or Workshop Item
(Paper)
Abstract
A novel probability density function (PDF) estimation based over-sampling approach is proposed for two-class imbalanced classification problems. The Parzen-window kernel function is applied to estimate the PDF of the positive class, from which synthetic instances are generated as additional training data to re-balance the class distribution. Utilising the re-balanced over-sampled training data, a radial basis function (RBF) classifier is constructed by applying an orthogonal forward regression, in which the classifier's structure and the parameters of RBF kernels are determined using a particle swarm optimisation algorithm based on the criterion of minimising the leave-one-out misclassification rate. The effectiveness of the proposed approach is demonstrated by an empirical study on several imbalanced data sets.
Text
ijcnn2012-id5.pdf
- Version of Record
Text
P-ijcnn2012-5.pdf
- Other
More information
Published date: 2012
Venue - Dates:
International Joint Conference on Neural Networks, Australia, 2012-06-10 - 2012-06-15
Organisations:
Southampton Wireless Group
Identifiers
Local EPrints ID: 338823
URI: http://eprints.soton.ac.uk/id/eprint/338823
PURE UUID: f1960391-a1ca-477b-ac06-a4da913e82b3
Catalogue record
Date deposited: 17 May 2012 15:11
Last modified: 09 Dec 2019 20:07
Export record
Contributors
Author:
Ming Gao
Author:
Xia Hong
Author:
Sheng Chen
Author:
Chris J. Harris
University divisions
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics