The University of Southampton
University of Southampton Institutional Repository

Auditory cortical responses as an objective predictor of speech-in-noise performance

Auditory cortical responses as an objective predictor of speech-in-noise performance
Auditory cortical responses as an objective predictor of speech-in-noise performance
The cortical entrainment of speech envelope is a phenomenon where the electrical activity in the brain fluctuates with the change in stimulus intensity, i.e., the stimulus envelope. While clearly speech stimuli are closer to everyday conversation, it is not clear whether the cortical entrainment of speech envelope closely reflects speech intelligibility in the brain more than cortical responses to other types of sound. The overall goal of this thesis is to assess the applicability of cortical responses to speech and non-speech sound for use as a predictor of behavioural speech intelligibility in normal hearing people. Initial works were carried out to explore how to best measure and detect cortical responses to continuous speech.

In the first study cortical responses to continuous speech were stronger when additional pauses were inserted into the stimulus, and this consequently led to a greater the number of detected responses. This also demonstrate that the amount of pauses could introduce a comparison bias as they have different envelopes, it was therefore decided that the continuous speech and non-speech stimuli used in the third study would comprise identical envelope shapes. In the second study, the backward linear modelling (reconstructing speech envelope from cortical responses) was able to detect greater number of significant responses to continuous speech than the forward linear modelling (predicting cortical responses from speech envelope), however, the number of detected responses was lower than the cortical auditory evoked potentials (CAEP) using Hotelling’s T-squared. This suggests that evoked responses to continuous speech are harder to detect than more standard evoked responses. In the third study, behavioural speech recognition scores and objective measures showed significant correlation and it was possible to utilise cortical responses to continuous speech, modulated noise, and /da/ as objective measures to predict speech intelligibility in some subjects. However, the reliability of response measurement was poor, consequently the speech intelligibility prediction was inaccurate in many individuals.

With regards to the main goal of this thesis, all types of cortical responses showed expected monotonic relationship with the stimulus signal-to-noise ratio on a group level, cortical responses to continuous speech were worse in predicting the speech intelligibility in individuals than other types of responses. This gives empirical evidence to show that measurement of cortical responses to continuous speech through the linear modelling approach with scalp responses measurement are less reliable compared to cortical responses to less natural speech stimuli at an individual level. For normal hearing people, a measurement that simply confirms audibility (not necessarily speech intelligibility or comprehension) may be sufficient to predict speech-in-noise test performance.
University of Southampton
Deoisres, Suwijak
e454d5e5-c7c3-4027-bf4e-203c65a4e6cf
Deoisres, Suwijak
e454d5e5-c7c3-4027-bf4e-203c65a4e6cf
Simpson, David
53674880-f381-4cc9-8505-6a97eeac3c2a
Bell, Steven
91de0801-d2b7-44ba-8e8e-523e672aed8a

Deoisres, Suwijak (2023) Auditory cortical responses as an objective predictor of speech-in-noise performance. University of Southampton, Doctoral Thesis, 130pp.

Record type: Thesis (Doctoral)

Abstract

The cortical entrainment of speech envelope is a phenomenon where the electrical activity in the brain fluctuates with the change in stimulus intensity, i.e., the stimulus envelope. While clearly speech stimuli are closer to everyday conversation, it is not clear whether the cortical entrainment of speech envelope closely reflects speech intelligibility in the brain more than cortical responses to other types of sound. The overall goal of this thesis is to assess the applicability of cortical responses to speech and non-speech sound for use as a predictor of behavioural speech intelligibility in normal hearing people. Initial works were carried out to explore how to best measure and detect cortical responses to continuous speech.

In the first study cortical responses to continuous speech were stronger when additional pauses were inserted into the stimulus, and this consequently led to a greater the number of detected responses. This also demonstrate that the amount of pauses could introduce a comparison bias as they have different envelopes, it was therefore decided that the continuous speech and non-speech stimuli used in the third study would comprise identical envelope shapes. In the second study, the backward linear modelling (reconstructing speech envelope from cortical responses) was able to detect greater number of significant responses to continuous speech than the forward linear modelling (predicting cortical responses from speech envelope), however, the number of detected responses was lower than the cortical auditory evoked potentials (CAEP) using Hotelling’s T-squared. This suggests that evoked responses to continuous speech are harder to detect than more standard evoked responses. In the third study, behavioural speech recognition scores and objective measures showed significant correlation and it was possible to utilise cortical responses to continuous speech, modulated noise, and /da/ as objective measures to predict speech intelligibility in some subjects. However, the reliability of response measurement was poor, consequently the speech intelligibility prediction was inaccurate in many individuals.

With regards to the main goal of this thesis, all types of cortical responses showed expected monotonic relationship with the stimulus signal-to-noise ratio on a group level, cortical responses to continuous speech were worse in predicting the speech intelligibility in individuals than other types of responses. This gives empirical evidence to show that measurement of cortical responses to continuous speech through the linear modelling approach with scalp responses measurement are less reliable compared to cortical responses to less natural speech stimuli at an individual level. For normal hearing people, a measurement that simply confirms audibility (not necessarily speech intelligibility or comprehension) may be sufficient to predict speech-in-noise test performance.

Text
Deoisres_Suwijak_PhdThesis_2023 - Version of Record
Available under License University of Southampton Thesis Licence.
Download (9MB)
Text
Final-thesis-submission-Examination-Mr-Suwijak-Deoisres
Restricted to Repository staff only

More information

Published date: 20 April 2023

Identifiers

Local EPrints ID: 476400
URI: http://eprints.soton.ac.uk/id/eprint/476400
PURE UUID: 008405b2-b983-4d2c-83b0-11a0d646dbc2
ORCID for Suwijak Deoisres: ORCID iD orcid.org/0000-0003-1384-3632
ORCID for David Simpson: ORCID iD orcid.org/0000-0001-9072-5088

Catalogue record

Date deposited: 20 Apr 2023 16:35
Last modified: 17 Mar 2024 02:56

Export record

Contributors

Author: Suwijak Deoisres ORCID iD
Thesis advisor: David Simpson ORCID iD
Thesis advisor: Steven Bell

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×