The University of Southampton
University of Southampton Institutional Repository

Modelling the effect of cochlear implant filterbank characteristics on speech perception

Modelling the effect of cochlear implant filterbank characteristics on speech perception
Modelling the effect of cochlear implant filterbank characteristics on speech perception
The characteristics of a cochlear implant (CI) filterbank determine the coding of spectral and temporal information in it. Hence, it is important to optimise the filterbank parameters to achieve optimal benefit in CI users. The present thesis aimed at modelling how the manipulation of the filterbank analysis length and the assignment of spectral channels may effect CI speech perception, using CI acoustical simulation techniques. Investigations were carried out to study the efficacy of providing additional spectral information in low and/or mid frequency channels using a longer filterbank analysis window, with respect to CI processed speech perception in various types of background noise. However, the increase of filterbank analysis length has an associated trade-off, which is a reduction in temporal information. Only a few CI acoustic simulations studies have modelled the characteristics of the FFT filterbank, the most commonly used filterbank in commercial CI processors. An initial experiment was carried out to validate the CI acoustical simulation technique used in the present thesis that implemented an FFT filterbank analysis. Next, the effect of a reduction in temporal information with the increase of the FFT analysis window length was studied. A filterbank with 16 ms analysis window, without the implementation of its finer spectral coding abilities, performed marginally poorer to that of a 4 ms analysis window in a sentence recognition test. The finer spectral coding abilities of the filterbank with 16 ms analysis window, when implemented, revealed that CI processed speech perception in noise could be significantly improved if additional spectral information is provided in the low and mid frequencies. The assignment of additional spectral channels to the low and mid frequencies led to a corresponding reduction in spectral channels assigned to high frequencies. However, no detrimental effect in speech perception was observed as long as at least two spectral channels represented information above 3 kHz. The assignment of additionallow and mid frequency spectral channels also led to significant levels of spectral shift.
The significant benefits from additional low and mid frequency information, however, were lost when the effects of spectral shift were introduced in acute experiments, without any training or acclimatisation period. The findings of the present thesis highlight that a longer filterbank analysis, such as 16 ms, may be implemented in CI devices without the fear of any perceptual cost due to a reduction in temporal information, at least for tasks that do not require talker separation. Providing additional low and mid frequency spectral information with a longer filterbank analysis has the potential to improve CI speech perception. However, to obtain potential benefits, the effects of spectral shift should be overcome. The findings of this thesis, however, need to be interpreted considering the limitations of CI acoustical simulation experiments.
Chowdhury, Shibasis
c976020b-13ed-443e-b9a6-fd0cec88c4fa
Chowdhury, Shibasis
c976020b-13ed-443e-b9a6-fd0cec88c4fa
Verschuur, C.A.
5e15ee1c-3a44-4dbe-ad43-ec3b50111e41

Chowdhury, Shibasis (2013) Modelling the effect of cochlear implant filterbank characteristics on speech perception. University of Southampton, Faculty of Engineering and the Environment, Doctoral Thesis, 257pp.

Record type: Thesis (Doctoral)

Abstract

The characteristics of a cochlear implant (CI) filterbank determine the coding of spectral and temporal information in it. Hence, it is important to optimise the filterbank parameters to achieve optimal benefit in CI users. The present thesis aimed at modelling how the manipulation of the filterbank analysis length and the assignment of spectral channels may effect CI speech perception, using CI acoustical simulation techniques. Investigations were carried out to study the efficacy of providing additional spectral information in low and/or mid frequency channels using a longer filterbank analysis window, with respect to CI processed speech perception in various types of background noise. However, the increase of filterbank analysis length has an associated trade-off, which is a reduction in temporal information. Only a few CI acoustic simulations studies have modelled the characteristics of the FFT filterbank, the most commonly used filterbank in commercial CI processors. An initial experiment was carried out to validate the CI acoustical simulation technique used in the present thesis that implemented an FFT filterbank analysis. Next, the effect of a reduction in temporal information with the increase of the FFT analysis window length was studied. A filterbank with 16 ms analysis window, without the implementation of its finer spectral coding abilities, performed marginally poorer to that of a 4 ms analysis window in a sentence recognition test. The finer spectral coding abilities of the filterbank with 16 ms analysis window, when implemented, revealed that CI processed speech perception in noise could be significantly improved if additional spectral information is provided in the low and mid frequencies. The assignment of additional spectral channels to the low and mid frequencies led to a corresponding reduction in spectral channels assigned to high frequencies. However, no detrimental effect in speech perception was observed as long as at least two spectral channels represented information above 3 kHz. The assignment of additionallow and mid frequency spectral channels also led to significant levels of spectral shift.
The significant benefits from additional low and mid frequency information, however, were lost when the effects of spectral shift were introduced in acute experiments, without any training or acclimatisation period. The findings of the present thesis highlight that a longer filterbank analysis, such as 16 ms, may be implemented in CI devices without the fear of any perceptual cost due to a reduction in temporal information, at least for tasks that do not require talker separation. Providing additional low and mid frequency spectral information with a longer filterbank analysis has the potential to improve CI speech perception. However, to obtain potential benefits, the effects of spectral shift should be overcome. The findings of this thesis, however, need to be interpreted considering the limitations of CI acoustical simulation experiments.

Text
Shibasis Chowdhury_PhD E-Thesis_Modelling the Effects of Cochlear Implant Filterbank Characteristics on Speech Perception.pdf - Other
Restricted to Repository staff only

More information

Published date: August 2013
Organisations: University of Southampton, Inst. Sound & Vibration Research

Identifiers

Local EPrints ID: 366252
URI: http://eprints.soton.ac.uk/id/eprint/366252
PURE UUID: e4f545de-b85a-42c0-adfa-d3812c010d29

Catalogue record

Date deposited: 26 Jun 2014 11:57
Last modified: 14 Mar 2024 17:05

Export record

Contributors

Author: Shibasis Chowdhury
Thesis advisor: C.A. Verschuur

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×