Sparseness and speech perception in noise
Sparseness and speech perception in noise
Can we model speech recognition in noise by exploring higher order statistics of the combined signal? How will changes in these statistics affect speech perception in noise? This study addresses these questions in two experiments. One investigated the relationship between an established "glimpsing" model and the fourth order statistic, kurtosis. The glimpsing model [1] proposes that listeners can explore the local speech-to-noise ratio (SNR) in short time segments (glimpses) and focus on areas where SNR is high. Results showed that there is a very high correlation between percentages of glimpsing area and kurtosis (r = 0.99;p < 0.01), suggesting that kurtosis can serve as a simpler index for measuring glimpsing. The experiment also examined the association between kurtosis and recognition of nonsense words (vowel-consonant-vowel, VCV) in babble modulated noise, also showing very high correlation (r = 0.97;p < 0.01). Another separate study focused on the relationship of sparseness to speech recognition score for VCV words in natural babble noise made of 100 people talking simultaneously [2]. Results show that there is also high correlation between kurtosis and speech recognition score with this noise. Logistic regression analysis to obtain the kurtosis for 50% correct showed this was achieved at a kurtosis of approximately 1.0
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Li, Guoping
b791b5c0-52cb-4311-b0de-3d6b2f289835
Li, Guoping and Lutman, Mark E.
(2006)
Sparseness and speech perception in noise.
Ninth International Conference on Spoken Language Processing ?Interspeech - ICSLP), Pittsburgh, United States.
17 - 21 Sep 2006.
Record type:
Conference or Workshop Item
(Paper)
Abstract
Can we model speech recognition in noise by exploring higher order statistics of the combined signal? How will changes in these statistics affect speech perception in noise? This study addresses these questions in two experiments. One investigated the relationship between an established "glimpsing" model and the fourth order statistic, kurtosis. The glimpsing model [1] proposes that listeners can explore the local speech-to-noise ratio (SNR) in short time segments (glimpses) and focus on areas where SNR is high. Results showed that there is a very high correlation between percentages of glimpsing area and kurtosis (r = 0.99;p < 0.01), suggesting that kurtosis can serve as a simpler index for measuring glimpsing. The experiment also examined the association between kurtosis and recognition of nonsense words (vowel-consonant-vowel, VCV) in babble modulated noise, also showing very high correlation (r = 0.97;p < 0.01). Another separate study focused on the relationship of sparseness to speech recognition score for VCV words in natural babble noise made of 100 people talking simultaneously [2]. Results show that there is also high correlation between kurtosis and speech recognition score with this noise. Logistic regression analysis to obtain the kurtosis for 50% correct showed this was achieved at a kurtosis of approximately 1.0
Text
sparseness_and_speech_perception_in_noise.pdf
- Other
More information
e-pub ahead of print date: September 2006
Venue - Dates:
Ninth International Conference on Spoken Language Processing ?Interspeech - ICSLP), Pittsburgh, United States, 2006-09-17 - 2006-09-21
Identifiers
Local EPrints ID: 188337
URI: http://eprints.soton.ac.uk/id/eprint/188337
PURE UUID: 8582a89a-9a9f-49e2-a88a-3a5fd3177d8a
Catalogue record
Date deposited: 01 Jun 2011 08:31
Last modified: 14 Mar 2024 03:30
Export record
Contributors
Author:
Guoping Li
Author:
Mark E. Lutman
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics