The University of Southampton
University of Southampton Institutional Repository

Relationship between speech recognition in noise and sparseness

Relationship between speech recognition in noise and sparseness
Relationship between speech recognition in noise and sparseness
Established methods for predicting speech recognition in noise require knowledge of clean speech signals, placing limitations on their application. The study evaluates an alternative approach based on characteristics of noisy speech, specifically its sparseness as represented by the statistic kurtosis. Design: Experiments 1 and 2 involved acoustic analysis of vowel-consonant-vowel (VCV) syllables in babble noise, comparing kurtosis, glimpsing areas, and extended speech intelligibility index (ESII) of noisy speech signals with one another and with pre-existing speech recognition scores. Experiment 3 manipulated kurtosis of VCV syllables and investigated effects on speech recognition scores in normal-hearing listeners. Study sample: Pre-existing speech recognition data for Experiments 1 and 2; seven normal-hearing participants for Experiment 3. Results: Experiments 1 and 2 demonstrated that kurtosis calculated in the time-domain from noisy speech is highly correlated (r > 0.98) with established prediction models: glimpsing and ESII. All three measures predicted speech recognition scores well. The final experiment showed a clear monotonic relationship between speech recognition scores and kurtosis. Conclusions: Speech recognition performance in noise is closely related to the sparseness (kurtosis) of the noisy speech signal, at least for the types of speech and noise used here and for listeners with normal hearing


glimpsing, babble, speech intelligibility index, cocktail party effect
75-82
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Lutman, Mark E.
Wang, Shouyan
fa12f1bf-cac9-4118-abdd-9d52f235b05c
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Lutman, Mark E.
Wang, Shouyan
fa12f1bf-cac9-4118-abdd-9d52f235b05c
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c

Li, Guoping, Lutman, Mark E., Wang, Shouyan and Bleeck, Stefan (2012) Relationship between speech recognition in noise and sparseness. International Journal of Audiology, 51 (2), 75-82. (doi:10.3109/14992027.2011.625984). (PMID:22107445)

Record type: Article

Abstract

Established methods for predicting speech recognition in noise require knowledge of clean speech signals, placing limitations on their application. The study evaluates an alternative approach based on characteristics of noisy speech, specifically its sparseness as represented by the statistic kurtosis. Design: Experiments 1 and 2 involved acoustic analysis of vowel-consonant-vowel (VCV) syllables in babble noise, comparing kurtosis, glimpsing areas, and extended speech intelligibility index (ESII) of noisy speech signals with one another and with pre-existing speech recognition scores. Experiment 3 manipulated kurtosis of VCV syllables and investigated effects on speech recognition scores in normal-hearing listeners. Study sample: Pre-existing speech recognition data for Experiments 1 and 2; seven normal-hearing participants for Experiment 3. Results: Experiments 1 and 2 demonstrated that kurtosis calculated in the time-domain from noisy speech is highly correlated (r > 0.98) with established prediction models: glimpsing and ESII. All three measures predicted speech recognition scores well. The final experiment showed a clear monotonic relationship between speech recognition scores and kurtosis. Conclusions: Speech recognition performance in noise is closely related to the sparseness (kurtosis) of the noisy speech signal, at least for the types of speech and noise used here and for listeners with normal hearing


This record has no associated files available for download.

More information

e-pub ahead of print date: 22 November 2011
Published date: February 2012
Keywords: glimpsing, babble, speech intelligibility index, cocktail party effect
Organisations: Human Sciences Group

Identifiers

Local EPrints ID: 334268
URI: http://eprints.soton.ac.uk/id/eprint/334268
PURE UUID: d77c9cc1-41eb-40a8-ae42-4023c1f23298
ORCID for Stefan Bleeck: ORCID iD orcid.org/0000-0003-4378-3394

Catalogue record

Date deposited: 06 Mar 2012 14:53
Last modified: 15 Mar 2024 03:25

Export record

Altmetrics

Contributors

Author: Guoping Li
Author: Mark E. Lutman
Author: Shouyan Wang
Author: Stefan Bleeck ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×