The University of Southampton
University of Southampton Institutional Repository

Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users
Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users
Speech understanding in noisy environments is still one of the major challenges for cochlear implant(CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks(NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy speech signal into time-frequency units, extracts a set of auditory-inspired features and feeds them to the neural network to produce an estimation of which frequency channels contain more perceptually important information (higher signal-to-noise ratio, SNR). This estimate is used to attenuate noise-dominated and retain speech-dominated CI channels for electrical stimulation, as in traditional n-of-m CI coding strategies. The proposed algorithm was evaluated by measuring the speech-in-noise performance of 14 CI users using three types of background noise. Two NNSE algorithms were compared: a speaker-dependent algorithm, that was trained on the target speaker used for testing, and a speaker-independent algorithm, that was trained on different speakers. Significant improvements in the intelligibility of speech in stationary and fluctuating noises were found relative to the unprocessed condition for the speaker-dependent algorithm in all noise types and for the speaker-independent algorithm in 2 out of 3 noise types. The NNSE algorithms used noise-specific neural networks that generalized to novel segments of the same noise type and worked over a range of SNRs. The proposed algorithm has the potential to improve the intelligibility of speech in noise for CI users while meeting the requirements of low computational complexity and processing delay for application in CI devices.
Cochlear implants, Noise reduction, Speech enhancement, Machine learning, Neural networks
0378-5955
183-194
Goehring, Tobias
11007d58-6905-451e-aa60-1e1ea681f15a
Bolner, Federico
37211af2-a077-42f7-aaec-21be5db4158d
Monaghan, Jessica J. M.
c6e0821f-a660-4f07-85ac-66033f0e0b44
van Dijk, Bas
d3a89823-6a40-4856-8f94-45e73f0cae3a
Zarowski, Andrzej
d8a3360c-5041-498f-a7fb-de6964b50243
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
Goehring, Tobias
11007d58-6905-451e-aa60-1e1ea681f15a
Bolner, Federico
37211af2-a077-42f7-aaec-21be5db4158d
Monaghan, Jessica J. M.
c6e0821f-a660-4f07-85ac-66033f0e0b44
van Dijk, Bas
d3a89823-6a40-4856-8f94-45e73f0cae3a
Zarowski, Andrzej
d8a3360c-5041-498f-a7fb-de6964b50243
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c

Goehring, Tobias, Bolner, Federico and Monaghan, Jessica J. M. et al. (2017) Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users. Hearing Research, 344, 183-194. (doi:10.1016/j.heares.2016.11.012).

Record type: Article

Abstract

Speech understanding in noisy environments is still one of the major challenges for cochlear implant(CI) users in everyday life. We evaluated a speech enhancement algorithm based on neural networks(NNSE) for improving speech intelligibility in noise for CI users. The algorithm decomposes the noisy speech signal into time-frequency units, extracts a set of auditory-inspired features and feeds them to the neural network to produce an estimation of which frequency channels contain more perceptually important information (higher signal-to-noise ratio, SNR). This estimate is used to attenuate noise-dominated and retain speech-dominated CI channels for electrical stimulation, as in traditional n-of-m CI coding strategies. The proposed algorithm was evaluated by measuring the speech-in-noise performance of 14 CI users using three types of background noise. Two NNSE algorithms were compared: a speaker-dependent algorithm, that was trained on the target speaker used for testing, and a speaker-independent algorithm, that was trained on different speakers. Significant improvements in the intelligibility of speech in stationary and fluctuating noises were found relative to the unprocessed condition for the speaker-dependent algorithm in all noise types and for the speaker-independent algorithm in 2 out of 3 noise types. The NNSE algorithms used noise-specific neural networks that generalized to novel segments of the same noise type and worked over a range of SNRs. The proposed algorithm has the potential to improve the intelligibility of speech in noise for CI users while meeting the requirements of low computational complexity and processing delay for application in CI devices.

Text
HEARES_2016.pdf copy.pdf - Accepted Manuscript
Download (3MB)

More information

Accepted/In Press date: 20 November 2016
e-pub ahead of print date: 30 November 2016
Published date: February 2017
Keywords: Cochlear implants, Noise reduction, Speech enhancement, Machine learning, Neural networks
Organisations: Human Sciences Group

Identifiers

Local EPrints ID: 403269
URI: http://eprints.soton.ac.uk/id/eprint/403269
ISSN: 0378-5955
PURE UUID: 606bb854-1704-49c4-949a-a32d68f40c03
ORCID for Stefan Bleeck: ORCID iD orcid.org/0000-0003-4378-3394

Catalogue record

Date deposited: 29 Nov 2016 14:00
Last modified: 16 Mar 2024 03:49

Export record

Altmetrics

Contributors

Author: Tobias Goehring
Author: Federico Bolner
Author: Jessica J. M. Monaghan
Author: Bas van Dijk
Author: Andrzej Zarowski
Author: Stefan Bleeck ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×