The University of Southampton
University of Southampton Institutional Repository

Sparseness and speech perception in noise

Sparseness and speech perception in noise
Sparseness and speech perception in noise
Can we model speech recognition in noise by exploring higher order statistics of the combined signal? How will changes in these statistics affect speech perception in noise? This study addresses these questions in two experiments. One investigated the relationship between an established "glimpsing" model and the fourth order statistic, kurtosis. The glimpsing model [1] proposes that listeners can explore the local speech-to-noise ratio (SNR) in short time segments (glimpses) and focus on areas where SNR is high. Results showed that there is a very high correlation between percentages of glimpsing area and kurtosis (r = 0.99;p < 0.01), suggesting that kurtosis can serve as a simpler index for measuring glimpsing. The experiment also examined the association between kurtosis and recognition of nonsense words (vowel-consonant-vowel, VCV) in babble modulated noise, also showing very high correlation (r = 0.97;p < 0.01). Another separate study focused on the relationship of sparseness to speech recognition score for VCV words in natural babble noise made of 100 people talking simultaneously [2]. Results show that there is also high correlation between kurtosis and speech recognition score with this noise. Logistic regression analysis to obtain the kurtosis for 50% correct showed this was achieved at a kurtosis of approximately 1.0
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Lutman, Mark E.
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Lutman, Mark E.

Li, Guoping and Lutman, Mark E. (2006) Sparseness and speech perception in noise. Ninth International Conference on Spoken Language Processing ?Interspeech - ICSLP), Pittsburgh, United States. 17 - 21 Sep 2006.

Record type: Conference or Workshop Item (Paper)

Abstract

Can we model speech recognition in noise by exploring higher order statistics of the combined signal? How will changes in these statistics affect speech perception in noise? This study addresses these questions in two experiments. One investigated the relationship between an established "glimpsing" model and the fourth order statistic, kurtosis. The glimpsing model [1] proposes that listeners can explore the local speech-to-noise ratio (SNR) in short time segments (glimpses) and focus on areas where SNR is high. Results showed that there is a very high correlation between percentages of glimpsing area and kurtosis (r = 0.99;p < 0.01), suggesting that kurtosis can serve as a simpler index for measuring glimpsing. The experiment also examined the association between kurtosis and recognition of nonsense words (vowel-consonant-vowel, VCV) in babble modulated noise, also showing very high correlation (r = 0.97;p < 0.01). Another separate study focused on the relationship of sparseness to speech recognition score for VCV words in natural babble noise made of 100 people talking simultaneously [2]. Results show that there is also high correlation between kurtosis and speech recognition score with this noise. Logistic regression analysis to obtain the kurtosis for 50% correct showed this was achieved at a kurtosis of approximately 1.0

Text
sparseness_and_speech_perception_in_noise.pdf - Other
Download (457kB)

More information

e-pub ahead of print date: September 2006
Venue - Dates: Ninth International Conference on Spoken Language Processing ?Interspeech - ICSLP), Pittsburgh, United States, 2006-09-17 - 2006-09-21

Identifiers

Local EPrints ID: 188337
URI: http://eprints.soton.ac.uk/id/eprint/188337
PURE UUID: 8582a89a-9a9f-49e2-a88a-3a5fd3177d8a

Catalogue record

Date deposited: 01 Jun 2011 08:31
Last modified: 14 Mar 2024 03:30

Export record

Contributors

Author: Guoping Li
Author: Mark E. Lutman

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×