The University of Southampton
University of Southampton Institutional Repository

Sparse non-negative matrix factorization strategy for cochlear implants

Sparse non-negative matrix factorization strategy for cochlear implants
Sparse non-negative matrix factorization strategy for cochlear implants
Current cochlear implant (CI) strategies carry speech information via the waveform envelope in frequency subbands. CIs require efficient speech processing to maximize information transfer to the brain, especially in background noise, where the speech envelope is not robust to noise interference. In such conditions, the envelope, after decomposition into frequency bands, may be enhanced by sparse transformations, such as nonnegative matrix factorization (NMF). Here, a novel CI processing algorithm is described, which works by applying NMF to the envelope matrix (envelopogram) of 22 frequency channels in order to improve performance in noisy environments. It is evaluated for speech in eight-talker babble noise. The critical sparsity constraint parameter was first tuned using objective measures and then evaluated with subjective speech perception experiments for both normal hearing and CI subjects. Results from vocoder simulations with 10 normal hearing subjects showed that the algorithm significantly enhances speech intelligibility with the selected sparsity constraints. Results from eight CI subjects showed no significant overall improvement compared with the standard advanced combination encoder algorithm, but a trend toward improvement of word identification of about 10 percentage points at +15 dB signal-to-noise ratio (SNR) was observed in the eight CI subjects. Additionally, a considerable reduction of the spread of speech perception performance from 40% to 93% for advanced combination encoder to 80% to 100% for the suggested NMF coding strategy was observed.
2331-2165
Hu, Hongmei
619a5602-4865-4100-9be9-f31572a0953d
Lutman, Mark E.
Ewert, Stephan D.
0d066c6b-cde8-47c4-ba4d-b32b11a58876
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
Hu, Hongmei
619a5602-4865-4100-9be9-f31572a0953d
Lutman, Mark E.
Ewert, Stephan D.
0d066c6b-cde8-47c4-ba4d-b32b11a58876
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c

Hu, Hongmei, Lutman, Mark E., Ewert, Stephan D., Li, Guoping and Bleeck, Stefan (2015) Sparse non-negative matrix factorization strategy for cochlear implants. Trends in Hearing, 19. (doi:10.1177/2331216515616941).

Record type: Article

Abstract

Current cochlear implant (CI) strategies carry speech information via the waveform envelope in frequency subbands. CIs require efficient speech processing to maximize information transfer to the brain, especially in background noise, where the speech envelope is not robust to noise interference. In such conditions, the envelope, after decomposition into frequency bands, may be enhanced by sparse transformations, such as nonnegative matrix factorization (NMF). Here, a novel CI processing algorithm is described, which works by applying NMF to the envelope matrix (envelopogram) of 22 frequency channels in order to improve performance in noisy environments. It is evaluated for speech in eight-talker babble noise. The critical sparsity constraint parameter was first tuned using objective measures and then evaluated with subjective speech perception experiments for both normal hearing and CI subjects. Results from vocoder simulations with 10 normal hearing subjects showed that the algorithm significantly enhances speech intelligibility with the selected sparsity constraints. Results from eight CI subjects showed no significant overall improvement compared with the standard advanced combination encoder algorithm, but a trend toward improvement of word identification of about 10 percentage points at +15 dB signal-to-noise ratio (SNR) was observed in the eight CI subjects. Additionally, a considerable reduction of the spread of speech perception performance from 40% to 93% for advanced combination encoder to 80% to 100% for the suggested NMF coding strategy was observed.

This record has no associated files available for download.

More information

e-pub ahead of print date: 16 December 2015
Published date: 29 December 2015
Organisations: Human Sciences Group, Inst. Sound & Vibration Research

Identifiers

Local EPrints ID: 407136
URI: http://eprints.soton.ac.uk/id/eprint/407136
ISSN: 2331-2165
PURE UUID: cedc1753-fca3-45f3-aeb7-3ff42b7326f2
ORCID for Stefan Bleeck: ORCID iD orcid.org/0000-0003-4378-3394

Catalogue record

Date deposited: 30 Mar 2017 01:07
Last modified: 16 Mar 2024 03:49

Export record

Altmetrics

Contributors

Author: Hongmei Hu
Author: Mark E. Lutman
Author: Stephan D. Ewert
Author: Guoping Li
Author: Stefan Bleeck ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×