The University of Southampton
University of Southampton Institutional Repository

Sum-of-norms regularized nonnegative matrix factorization

Sum-of-norms regularized nonnegative matrix factorization
Sum-of-norms regularized nonnegative matrix factorization
When applying nonnegative matrix factorization (NMF), generally the rank parameter is unknown. Such rank in NMF, called the nonnegative rank, is usually estimated heuristically since computing the exact value of it is NP-hard. In this work, we propose an approximation method to estimate such rank while solving NMF on-the-fly. We use sum-of-norm (SON), a group-lasso structure that encourages pairwise similarity, to reduce the rank of a factor matrix where the rank is overestimated at the beginning. On various datasets, SON-NMF is able to reveal the correct nonnegative rank of the data without any prior knowledge nor tuning. SON-NMF is a nonconvx nonsmmoth non-separable non-proximable problem, solving it is nontrivial. First, as rank estimation in NMF is NP-hard, the proposed approach does not enjoy a lower computational complexity. Using a graph-theoretic argument, we prove that the complexity of the SON-NMF is almost irreducible. Second, the per-iteration cost of any algorithm solving SON-NMF is possibly high, which motivated us to propose a first-order BCD algorithm to approximately solve SON-NMF with a low per-iteration cost, in which we do so by the proximal average operator. Lastly, we propose a simple greedy method for post-processing. SON-NMF exhibits favourable features for applications. Beside the ability to automatically estimate the rank from data, SON-NMF can deal with rank-deficient data matrix, can detect weak component with small energy. Furthermore, on the application of hyperspectral imaging, SON-NMF handle the issue of spectral variability naturally.
cs.LG, math.OC, stat.ML
arXiv
Ang, Andersen
ed509ecd-39a3-4887-a709-339fdaded867
Hamed, Waqas Bin
4f8a8f76-daa8-4bcf-9cde-cb80470e811a
Sterck, Hans De
2ed04478-7382-446f-93a7-6ce8462049eb
Ang, Andersen
ed509ecd-39a3-4887-a709-339fdaded867
Hamed, Waqas Bin
4f8a8f76-daa8-4bcf-9cde-cb80470e811a
Sterck, Hans De
2ed04478-7382-446f-93a7-6ce8462049eb

[Unknown type: UNSPECIFIED]

Record type: UNSPECIFIED

Abstract

When applying nonnegative matrix factorization (NMF), generally the rank parameter is unknown. Such rank in NMF, called the nonnegative rank, is usually estimated heuristically since computing the exact value of it is NP-hard. In this work, we propose an approximation method to estimate such rank while solving NMF on-the-fly. We use sum-of-norm (SON), a group-lasso structure that encourages pairwise similarity, to reduce the rank of a factor matrix where the rank is overestimated at the beginning. On various datasets, SON-NMF is able to reveal the correct nonnegative rank of the data without any prior knowledge nor tuning. SON-NMF is a nonconvx nonsmmoth non-separable non-proximable problem, solving it is nontrivial. First, as rank estimation in NMF is NP-hard, the proposed approach does not enjoy a lower computational complexity. Using a graph-theoretic argument, we prove that the complexity of the SON-NMF is almost irreducible. Second, the per-iteration cost of any algorithm solving SON-NMF is possibly high, which motivated us to propose a first-order BCD algorithm to approximately solve SON-NMF with a low per-iteration cost, in which we do so by the proximal average operator. Lastly, we propose a simple greedy method for post-processing. SON-NMF exhibits favourable features for applications. Beside the ability to automatically estimate the rank from data, SON-NMF can deal with rank-deficient data matrix, can detect weak component with small energy. Furthermore, on the application of hyperspectral imaging, SON-NMF handle the issue of spectral variability naturally.

Text
2407.00706v1 - Author's Original
Available under License Creative Commons Attribution.
Download (17MB)

More information

Published date: 30 June 2024
Additional Information: 12 figures
Keywords: cs.LG, math.OC, stat.ML

Identifiers

Local EPrints ID: 495942
URI: http://eprints.soton.ac.uk/id/eprint/495942
PURE UUID: 69b3474f-c18f-45bf-94bb-dc358e4a1b99
ORCID for Andersen Ang: ORCID iD orcid.org/0000-0002-8330-758X

Catalogue record

Date deposited: 28 Nov 2024 17:30
Last modified: 30 Nov 2024 03:12

Export record

Altmetrics

Contributors

Author: Andersen Ang ORCID iD
Author: Waqas Bin Hamed
Author: Hans De Sterck

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×