Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners
Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners
Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wiener filtering, was compared. Two algorithms based on neural networks were examined, one using a previously reported feature set and one using a feature set derived from an auditory model. The third machine-learning approach was a dictionary-based sparse-coding algorithm. Speech intelligibility and quality scores were obtained for participants with mild-to-moderate hearing impairments listening to sentences in speech-shaped noise and multi-talker babble following processing with the algorithms. Intelligibility and quality scores were significantly improved by each of the three machine-learning approaches, but not by the classical approach. The largest improvements for both speech intelligibility and quality were found by implementing a neural network using the feature set based on auditory modeling. Furthermore, neural network based techniques appeared more promising than dictionary-based, sparse coding in terms of performance and ease of implementation.
1985-1998
Monaghan, Jessica J.M.
c6e0821f-a660-4f07-85ac-66033f0e0b44
Goehring, Tobias
11007d58-6905-451e-aa60-1e1ea681f15a
Yang, Xin
f6a01867-23d6-4548-8a1c-d4e76042f94e
Bolner, Federico
37211af2-a077-42f7-aaec-21be5db4158d
Wang, Shangqiguo
8aacacea-70cb-4de4-82c4-67619ea01f36
Wright, Matthew
b7209187-993d-4f18-8003-9f41aaf88abf
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
March 2017
Monaghan, Jessica J.M.
c6e0821f-a660-4f07-85ac-66033f0e0b44
Goehring, Tobias
11007d58-6905-451e-aa60-1e1ea681f15a
Yang, Xin
f6a01867-23d6-4548-8a1c-d4e76042f94e
Bolner, Federico
37211af2-a077-42f7-aaec-21be5db4158d
Wang, Shangqiguo
8aacacea-70cb-4de4-82c4-67619ea01f36
Wright, Matthew
b7209187-993d-4f18-8003-9f41aaf88abf
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
Monaghan, Jessica J.M., Goehring, Tobias, Yang, Xin, Bolner, Federico, Wang, Shangqiguo, Wright, Matthew and Bleeck, Stefan
(2017)
Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners.
The Journal of The Acoustical Society of America, 141 (3), .
(doi:10.1121/1.4977197).
Abstract
Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wiener filtering, was compared. Two algorithms based on neural networks were examined, one using a previously reported feature set and one using a feature set derived from an auditory model. The third machine-learning approach was a dictionary-based sparse-coding algorithm. Speech intelligibility and quality scores were obtained for participants with mild-to-moderate hearing impairments listening to sentences in speech-shaped noise and multi-talker babble following processing with the algorithms. Intelligibility and quality scores were significantly improved by each of the three machine-learning approaches, but not by the classical approach. The largest improvements for both speech intelligibility and quality were found by implementing a neural network using the feature set based on auditory modeling. Furthermore, neural network based techniques appeared more promising than dictionary-based, sparse coding in terms of performance and ease of implementation.
Text
ML_enhancement paper submitted
- Accepted Manuscript
Text
pdf_archiveJASMANvol_141iss_31985_1
- Version of Record
Restricted to Repository staff only
Request a copy
Text
-pdf_archive-JASMAN-vol_141-iss_3-1985_1
- Version of Record
Restricted to Repository staff only
Request a copy
More information
Accepted/In Press date: 9 February 2017
e-pub ahead of print date: 22 March 2017
Published date: March 2017
Organisations:
Human Sciences Group, Acoustics Group, Faculty Hub, Education Hub
Identifiers
Local EPrints ID: 407875
URI: http://eprints.soton.ac.uk/id/eprint/407875
PURE UUID: d3d7b581-26e2-4aa6-9eb2-20c19f4bad84
Catalogue record
Date deposited: 27 Apr 2017 01:12
Last modified: 16 Mar 2024 05:12
Export record
Altmetrics
Contributors
Author:
Jessica J.M. Monaghan
Author:
Tobias Goehring
Author:
Xin Yang
Author:
Federico Bolner
Author:
Shangqiguo Wang
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics