Modelling the effect of cochlear implant filterbank characteristics on speech perception
Modelling the effect of cochlear implant filterbank characteristics on speech perception
The characteristics of a cochlear implant (CI) filterbank determine the coding of spectral and temporal information in it. Hence, it is important to optimise the filterbank parameters to achieve optimal benefit in CI users. The present thesis aimed at modelling how the manipulation of the filterbank analysis length and the assignment of spectral channels may effect CI speech perception, using CI acoustical simulation techniques. Investigations were carried out to study the efficacy of providing additional spectral information in low and/or mid frequency channels using a longer filterbank analysis window, with respect to CI processed speech perception in various types of background noise. However, the increase of filterbank analysis length has an associated trade-off, which is a reduction in temporal information. Only a few CI acoustic simulations studies have modelled the characteristics of the FFT filterbank, the most commonly used filterbank in commercial CI processors. An initial experiment was carried out to validate the CI acoustical simulation technique used in the present thesis that implemented an FFT filterbank analysis. Next, the effect of a reduction in temporal information with the increase of the FFT analysis window length was studied. A filterbank with 16 ms analysis window, without the implementation of its finer spectral coding abilities, performed marginally poorer to that of a 4 ms analysis window in a sentence recognition test. The finer spectral coding abilities of the filterbank with 16 ms analysis window, when implemented, revealed that CI processed speech perception in noise could be significantly improved if additional spectral information is provided in the low and mid frequencies. The assignment of additional spectral channels to the low and mid frequencies led to a corresponding reduction in spectral channels assigned to high frequencies. However, no detrimental effect in speech perception was observed as long as at least two spectral channels represented information above 3 kHz. The assignment of additionallow and mid frequency spectral channels also led to significant levels of spectral shift.
The significant benefits from additional low and mid frequency information, however, were lost when the effects of spectral shift were introduced in acute experiments, without any training or acclimatisation period. The findings of the present thesis highlight that a longer filterbank analysis, such as 16 ms, may be implemented in CI devices without the fear of any perceptual cost due to a reduction in temporal information, at least for tasks that do not require talker separation. Providing additional low and mid frequency spectral information with a longer filterbank analysis has the potential to improve CI speech perception. However, to obtain potential benefits, the effects of spectral shift should be overcome. The findings of this thesis, however, need to be interpreted considering the limitations of CI acoustical simulation experiments.
Chowdhury, Shibasis
c976020b-13ed-443e-b9a6-fd0cec88c4fa
August 2013
Chowdhury, Shibasis
c976020b-13ed-443e-b9a6-fd0cec88c4fa
Verschuur, C.A.
5e15ee1c-3a44-4dbe-ad43-ec3b50111e41
Chowdhury, Shibasis
(2013)
Modelling the effect of cochlear implant filterbank characteristics on speech perception.
University of Southampton, Faculty of Engineering and the Environment, Doctoral Thesis, 257pp.
Record type:
Thesis
(Doctoral)
Abstract
The characteristics of a cochlear implant (CI) filterbank determine the coding of spectral and temporal information in it. Hence, it is important to optimise the filterbank parameters to achieve optimal benefit in CI users. The present thesis aimed at modelling how the manipulation of the filterbank analysis length and the assignment of spectral channels may effect CI speech perception, using CI acoustical simulation techniques. Investigations were carried out to study the efficacy of providing additional spectral information in low and/or mid frequency channels using a longer filterbank analysis window, with respect to CI processed speech perception in various types of background noise. However, the increase of filterbank analysis length has an associated trade-off, which is a reduction in temporal information. Only a few CI acoustic simulations studies have modelled the characteristics of the FFT filterbank, the most commonly used filterbank in commercial CI processors. An initial experiment was carried out to validate the CI acoustical simulation technique used in the present thesis that implemented an FFT filterbank analysis. Next, the effect of a reduction in temporal information with the increase of the FFT analysis window length was studied. A filterbank with 16 ms analysis window, without the implementation of its finer spectral coding abilities, performed marginally poorer to that of a 4 ms analysis window in a sentence recognition test. The finer spectral coding abilities of the filterbank with 16 ms analysis window, when implemented, revealed that CI processed speech perception in noise could be significantly improved if additional spectral information is provided in the low and mid frequencies. The assignment of additional spectral channels to the low and mid frequencies led to a corresponding reduction in spectral channels assigned to high frequencies. However, no detrimental effect in speech perception was observed as long as at least two spectral channels represented information above 3 kHz. The assignment of additionallow and mid frequency spectral channels also led to significant levels of spectral shift.
The significant benefits from additional low and mid frequency information, however, were lost when the effects of spectral shift were introduced in acute experiments, without any training or acclimatisation period. The findings of the present thesis highlight that a longer filterbank analysis, such as 16 ms, may be implemented in CI devices without the fear of any perceptual cost due to a reduction in temporal information, at least for tasks that do not require talker separation. Providing additional low and mid frequency spectral information with a longer filterbank analysis has the potential to improve CI speech perception. However, to obtain potential benefits, the effects of spectral shift should be overcome. The findings of this thesis, however, need to be interpreted considering the limitations of CI acoustical simulation experiments.
Text
Shibasis Chowdhury_PhD E-Thesis_Modelling the Effects of Cochlear Implant Filterbank Characteristics on Speech Perception.pdf
- Other
Restricted to Repository staff only
More information
Published date: August 2013
Organisations:
University of Southampton, Inst. Sound & Vibration Research
Identifiers
Local EPrints ID: 366252
URI: http://eprints.soton.ac.uk/id/eprint/366252
PURE UUID: e4f545de-b85a-42c0-adfa-d3812c010d29
Catalogue record
Date deposited: 26 Jun 2014 11:57
Last modified: 14 Mar 2024 17:05
Export record
Contributors
Author:
Shibasis Chowdhury
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics