Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners

Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wiener filtering, was compared. Two algorithms based on neural networks were examined, one using a previously reported feature set and one using a feature set derived from an auditory model. The third machine-learning approach was a dictionary-based sparse-coding algorithm. Speech intelligibility and quality scores were obtained for participants with mild-to-moderate hearing impairments listening to sentences in speech-shaped noise and multi-talker babble following processing with the algorithms. Intelligibility and quality scores were significantly improved by each of the three machine-learning approaches, but not by the classical approach. The largest improvements for both speech intelligibility and quality were found by implementing a neural network using the feature set based on auditory modeling. Furthermore, neural network based techniques appeared more promising than dictionary-based, sparse coding in terms of performance and ease of implementation.

10.1121/1.4977197

1985-1998

Monaghan, Jessica J.M.

c6e0821f-a660-4f07-85ac-66033f0e0b44

Goehring, Tobias

11007d58-6905-451e-aa60-1e1ea681f15a

Yang, Xin

f6a01867-23d6-4548-8a1c-d4e76042f94e

Bolner, Federico

37211af2-a077-42f7-aaec-21be5db4158d

Wang, Shangqiguo

8aacacea-70cb-4de4-82c4-67619ea01f36

Wright, Matthew

b7209187-993d-4f18-8003-9f41aaf88abf

Bleeck, Stefan

c888ccba-e64c-47bf-b8fa-a687e87ec16c

March 2017

Monaghan, Jessica J.M.

c6e0821f-a660-4f07-85ac-66033f0e0b44

Goehring, Tobias

11007d58-6905-451e-aa60-1e1ea681f15a

Yang, Xin

f6a01867-23d6-4548-8a1c-d4e76042f94e

Bolner, Federico

37211af2-a077-42f7-aaec-21be5db4158d

Wang, Shangqiguo

8aacacea-70cb-4de4-82c4-67619ea01f36

Wright, Matthew

b7209187-993d-4f18-8003-9f41aaf88abf

Bleeck, Stefan

c888ccba-e64c-47bf-b8fa-a687e87ec16c

Monaghan, Jessica J.M., Goehring, Tobias, Yang, Xin, Bolner, Federico, Wang, Shangqiguo, Wright, Matthew and Bleeck, Stefan (2017) Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners. The Journal of The Acoustical Society of America, 141 (3), 1985-1998. (doi:10.1121/1.4977197).

Record type: Article

Abstract

Text

ML_enhancement paper submitted - Accepted Manuscript

Available under License University of Southampton Accepted Manuscript Licence.

Download (143kB)

Text

pdf_archiveJASMANvol_141iss_31985_1 - Version of Record

Restricted to Repository staff only

Request a copy

Text

-pdf_archive-JASMAN-vol_141-iss_3-1985_1 - Version of Record

Restricted to Repository staff only

Request a copy

More information

Accepted/In Press date: 9 February 2017

e-pub ahead of print date: 22 March 2017

Published date: March 2017

Organisations: Human Sciences Group, Acoustics Group, Faculty Hub, Education Hub

Identifiers

Local EPrints ID: 407875

URI: http://eprints.soton.ac.uk/id/eprint/407875

DOI: doi:10.1121/1.4977197

PURE UUID: d3d7b581-26e2-4aa6-9eb2-20c19f4bad84

ORCID for Matthew Wright:

orcid.org/0000-0001-9393-4918

ORCID for Stefan Bleeck:

orcid.org/0000-0003-4378-3394

Catalogue record

Date deposited: 27 Apr 2017 01:12

Last modified: 16 Mar 2024 05:12

Export record

Altmetrics

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: Jessica J.M. Monaghan

Author: Tobias Goehring

Author: Xin Yang

Author: Federico Bolner

Author: Shangqiguo Wang

Author: Matthew Wright

Author: Stefan Bleeck

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information