Sum-of-norms regularized nonnegative matrix factorization
Sum-of-norms regularized nonnegative matrix factorization
When applying nonnegative matrix factorization (NMF), generally the rank parameter is unknown. Such rank in NMF, called the nonnegative rank, is usually estimated heuristically since computing the exact value of it is NP-hard. In this work, we propose an approximation method to estimate such rank while solving NMF on-the-fly. We use sum-of-norm (SON), a group-lasso structure that encourages pairwise similarity, to reduce the rank of a factor matrix where the rank is overestimated at the beginning. On various datasets, SON-NMF is able to reveal the correct nonnegative rank of the data without any prior knowledge nor tuning. SON-NMF is a nonconvx nonsmmoth non-separable non-proximable problem, solving it is nontrivial. First, as rank estimation in NMF is NP-hard, the proposed approach does not enjoy a lower computational complexity. Using a graph-theoretic argument, we prove that the complexity of the SON-NMF is almost irreducible. Second, the per-iteration cost of any algorithm solving SON-NMF is possibly high, which motivated us to propose a first-order BCD algorithm to approximately solve SON-NMF with a low per-iteration cost, in which we do so by the proximal average operator. Lastly, we propose a simple greedy method for post-processing. SON-NMF exhibits favourable features for applications. Beside the ability to automatically estimate the rank from data, SON-NMF can deal with rank-deficient data matrix, can detect weak component with small energy. Furthermore, on the application of hyperspectral imaging, SON-NMF handle the issue of spectral variability naturally.
cs.LG, math.OC, stat.ML
Ang, Andersen
ed509ecd-39a3-4887-a709-339fdaded867
Hamed, Waqas Bin
4f8a8f76-daa8-4bcf-9cde-cb80470e811a
Sterck, Hans De
2ed04478-7382-446f-93a7-6ce8462049eb
30 June 2024
Ang, Andersen
ed509ecd-39a3-4887-a709-339fdaded867
Hamed, Waqas Bin
4f8a8f76-daa8-4bcf-9cde-cb80470e811a
Sterck, Hans De
2ed04478-7382-446f-93a7-6ce8462049eb
[Unknown type: UNSPECIFIED]
Abstract
When applying nonnegative matrix factorization (NMF), generally the rank parameter is unknown. Such rank in NMF, called the nonnegative rank, is usually estimated heuristically since computing the exact value of it is NP-hard. In this work, we propose an approximation method to estimate such rank while solving NMF on-the-fly. We use sum-of-norm (SON), a group-lasso structure that encourages pairwise similarity, to reduce the rank of a factor matrix where the rank is overestimated at the beginning. On various datasets, SON-NMF is able to reveal the correct nonnegative rank of the data without any prior knowledge nor tuning. SON-NMF is a nonconvx nonsmmoth non-separable non-proximable problem, solving it is nontrivial. First, as rank estimation in NMF is NP-hard, the proposed approach does not enjoy a lower computational complexity. Using a graph-theoretic argument, we prove that the complexity of the SON-NMF is almost irreducible. Second, the per-iteration cost of any algorithm solving SON-NMF is possibly high, which motivated us to propose a first-order BCD algorithm to approximately solve SON-NMF with a low per-iteration cost, in which we do so by the proximal average operator. Lastly, we propose a simple greedy method for post-processing. SON-NMF exhibits favourable features for applications. Beside the ability to automatically estimate the rank from data, SON-NMF can deal with rank-deficient data matrix, can detect weak component with small energy. Furthermore, on the application of hyperspectral imaging, SON-NMF handle the issue of spectral variability naturally.
Text
2407.00706v1
- Author's Original
More information
Published date: 30 June 2024
Additional Information:
12 figures
Keywords:
cs.LG, math.OC, stat.ML
Identifiers
Local EPrints ID: 495942
URI: http://eprints.soton.ac.uk/id/eprint/495942
PURE UUID: 69b3474f-c18f-45bf-94bb-dc358e4a1b99
Catalogue record
Date deposited: 28 Nov 2024 17:30
Last modified: 30 Nov 2024 03:12
Export record
Altmetrics
Contributors
Author:
Andersen Ang
Author:
Waqas Bin Hamed
Author:
Hans De Sterck
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics