An efficient and versatile variational method for high-dimensional data classification
An efficient and versatile variational method for high-dimensional data classification
High-dimensional data classification is a fundamental task in machine learning and imaging science. In this paper, we propose an efficient and versatile multi-class semi-supervised classification method for classifying high-dimensional data and unstructured point clouds. To begin with, a warm initialization is generated by using a fuzzy classification method such as the standard support vector machine or random labeling. Then an unconstraint convex variational model is proposed to purify and smooth the initialization, followed by a step which is to project the smoothed partition obtained previously to a binary partition. These steps can be repeated, with the latest result as a new initialization, to keep improving the classification quality. We show that the convex model of the smoothing step has a unique solution and can be solved by a specifically designed primal–dual algorithm whose convergence is guaranteed. We test our method and compare it with the state-of-the-art methods on several benchmark data sets. Thorough experimental results demonstrate that our method is superior in both the classification accuracy and computation speed for high-dimensional data and point clouds.
Graph Laplacian, Point cloud classification, Semi-supervised clustering, Variational methods
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee
Chan, Raymond H.
1898019c-f54f-4b64-807d-d1335e76fd38
Xie, Xiaoyu
3923dab9-5693-49c5-a6f4-e0d79704b4df
Zeng, Tieyong
d84ccff4-4679-4f4c-81e9-39a155b0ec0e
1 August 2024
Cai, Xiaohao
de483445-45e9-4b21-a4e8-b0427fc72cee
Chan, Raymond H.
1898019c-f54f-4b64-807d-d1335e76fd38
Xie, Xiaoyu
3923dab9-5693-49c5-a6f4-e0d79704b4df
Zeng, Tieyong
d84ccff4-4679-4f4c-81e9-39a155b0ec0e
Cai, Xiaohao, Chan, Raymond H., Xie, Xiaoyu and Zeng, Tieyong
(2024)
An efficient and versatile variational method for high-dimensional data classification.
Journal of Scientific Computing, 100 (3), [81].
(doi:10.1007/s10915-024-02644-9).
Abstract
High-dimensional data classification is a fundamental task in machine learning and imaging science. In this paper, we propose an efficient and versatile multi-class semi-supervised classification method for classifying high-dimensional data and unstructured point clouds. To begin with, a warm initialization is generated by using a fuzzy classification method such as the standard support vector machine or random labeling. Then an unconstraint convex variational model is proposed to purify and smooth the initialization, followed by a step which is to project the smoothed partition obtained previously to a binary partition. These steps can be repeated, with the latest result as a new initialization, to keep improving the classification quality. We show that the convex model of the smoothing step has a unique solution and can be solved by a specifically designed primal–dual algorithm whose convergence is guaranteed. We test our method and compare it with the state-of-the-art methods on several benchmark data sets. Thorough experimental results demonstrate that our method is superior in both the classification accuracy and computation speed for high-dimensional data and point clouds.
Text
s10915-024-02644-9
- Version of Record
More information
Accepted/In Press date: 24 July 2024
Published date: 1 August 2024
Keywords:
Graph Laplacian, Point cloud classification, Semi-supervised clustering, Variational methods
Identifiers
Local EPrints ID: 493545
URI: http://eprints.soton.ac.uk/id/eprint/493545
ISSN: 0885-7474
PURE UUID: d62370df-62f8-4fbe-a860-809a6d5cc9b0
Catalogue record
Date deposited: 05 Sep 2024 17:14
Last modified: 06 Sep 2024 01:59
Export record
Altmetrics
Contributors
Author:
Xiaohao Cai
Author:
Raymond H. Chan
Author:
Xiaoyu Xie
Author:
Tieyong Zeng
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics