The University of Southampton
University of Southampton Institutional Repository

Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners

Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners
Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners
Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wiener filtering, was compared. Two algorithms based on neural networks were examined, one using a previously reported feature set and one using a feature set derived from an auditory model. The third machine-learning approach was a dictionary-based sparse-coding algorithm. Speech intelligibility and quality scores were obtained for participants with mild-to-moderate hearing impairments listening to sentences in speech-shaped noise and multi-talker babble following processing with the algorithms. Intelligibility and quality scores were significantly improved by each of the three machine-learning approaches, but not by the classical approach. The largest improvements for both speech intelligibility and quality were found by implementing a neural network using the feature set based on auditory modeling. Furthermore, neural network based techniques appeared more promising than dictionary-based, sparse coding in terms of performance and ease of implementation.
1985-1998
Monaghan, Jessica J.M.
c6e0821f-a660-4f07-85ac-66033f0e0b44
Goehring, Tobias
11007d58-6905-451e-aa60-1e1ea681f15a
Yang, Xin
f6a01867-23d6-4548-8a1c-d4e76042f94e
Bolner, Federico
37211af2-a077-42f7-aaec-21be5db4158d
Wang, Shangqiguo
8aacacea-70cb-4de4-82c4-67619ea01f36
Wright, Matthew
b7209187-993d-4f18-8003-9f41aaf88abf
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
Monaghan, Jessica J.M.
c6e0821f-a660-4f07-85ac-66033f0e0b44
Goehring, Tobias
11007d58-6905-451e-aa60-1e1ea681f15a
Yang, Xin
f6a01867-23d6-4548-8a1c-d4e76042f94e
Bolner, Federico
37211af2-a077-42f7-aaec-21be5db4158d
Wang, Shangqiguo
8aacacea-70cb-4de4-82c4-67619ea01f36
Wright, Matthew
b7209187-993d-4f18-8003-9f41aaf88abf
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c

Monaghan, Jessica J.M., Goehring, Tobias, Yang, Xin, Bolner, Federico, Wang, Shangqiguo, Wright, Matthew and Bleeck, Stefan (2017) Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners. The Journal of The Acoustical Society of America, 141 (3), 1985-1998. (doi:10.1121/1.4977197).

Record type: Article

Abstract

Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wiener filtering, was compared. Two algorithms based on neural networks were examined, one using a previously reported feature set and one using a feature set derived from an auditory model. The third machine-learning approach was a dictionary-based sparse-coding algorithm. Speech intelligibility and quality scores were obtained for participants with mild-to-moderate hearing impairments listening to sentences in speech-shaped noise and multi-talker babble following processing with the algorithms. Intelligibility and quality scores were significantly improved by each of the three machine-learning approaches, but not by the classical approach. The largest improvements for both speech intelligibility and quality were found by implementing a neural network using the feature set based on auditory modeling. Furthermore, neural network based techniques appeared more promising than dictionary-based, sparse coding in terms of performance and ease of implementation.

Text
ML_enhancement paper submitted - Accepted Manuscript
Download (143kB)
Text
pdf_archiveJASMANvol_141iss_31985_1 - Version of Record
Restricted to Repository staff only
Request a copy
Text
-pdf_archive-JASMAN-vol_141-iss_3-1985_1 - Version of Record
Restricted to Repository staff only
Request a copy

More information

Accepted/In Press date: 9 February 2017
e-pub ahead of print date: 22 March 2017
Published date: March 2017
Organisations: Human Sciences Group, Acoustics Group, Faculty Hub, Education Hub

Identifiers

Local EPrints ID: 407875
URI: http://eprints.soton.ac.uk/id/eprint/407875
PURE UUID: d3d7b581-26e2-4aa6-9eb2-20c19f4bad84
ORCID for Matthew Wright: ORCID iD orcid.org/0000-0001-9393-4918
ORCID for Stefan Bleeck: ORCID iD orcid.org/0000-0003-4378-3394

Catalogue record

Date deposited: 27 Apr 2017 01:12
Last modified: 16 Mar 2024 05:12

Export record

Altmetrics

Contributors

Author: Jessica J.M. Monaghan
Author: Tobias Goehring
Author: Xin Yang
Author: Federico Bolner
Author: Shangqiguo Wang
Author: Matthew Wright ORCID iD
Author: Stefan Bleeck ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×