The University of Southampton
University of Southampton Institutional Repository

Representations of fricatives in subcortical model responses: comparisons with human consonant perceptiona

Representations of fricatives in subcortical model responses: comparisons with human consonant perceptiona
Representations of fricatives in subcortical model responses: comparisons with human consonant perceptiona

Fricatives are obstruent sound contrasts made by airflow constrictions in the vocal tract that produce turbulence across the constriction or at a site downstream from the constriction. Fricatives exhibit significant intra/intersubject and contextual variability. Yet, fricatives are perceived with high accuracy. The current study investigated modeled neural responses to fricatives in the auditory nerve (AN) and inferior colliculus (IC) with the hypothesis that response profiles across populations of neurons provide robust correlates to consonant perception. Stimuli were 270 intervocalic fricatives (10 speakers × 9 fricatives × 3 utterances). Computational model response profiles had characteristic frequencies that were log-spaced from 125 Hz to 8 or 20 kHz to explore the impact of high-frequency responses. Confusion matrices generated by k-nearest-neighbor subspace classifiers were based on the profiles of average rates across characteristic frequencies as feature vectors. Model confusion matrices were compared with published behavioral data. The modeled AN and IC neural responses provided better predictions of behavioral accuracy than the stimulus spectra, and IC showed better accuracy than AN. Behavioral fricative accuracy was explained by modeled neural response profiles, whereas confusions were only partially explained. Extended frequencies improved accuracy based on the model IC, corroborating the importance of extended high frequencies in speech perception.

0001-4966
602-618
Hamza, Yasmeen
d6e729c6-e95c-4ae6-88ba-d4019648d205
Farhadi, Afagh
129ca0ec-dbef-452a-9172-ef1a28a82fd7
Schwarz, Douglas M.
211bef23-f543-442b-a200-69228269d226
McDonough, Joyce M.
43c8234c-93c6-406f-971c-3a8e21a8a96a
Carney, Laurel H.
fc2532cb-06f7-45ad-8ea7-b73f4e141db8
Hamza, Yasmeen
d6e729c6-e95c-4ae6-88ba-d4019648d205
Farhadi, Afagh
129ca0ec-dbef-452a-9172-ef1a28a82fd7
Schwarz, Douglas M.
211bef23-f543-442b-a200-69228269d226
McDonough, Joyce M.
43c8234c-93c6-406f-971c-3a8e21a8a96a
Carney, Laurel H.
fc2532cb-06f7-45ad-8ea7-b73f4e141db8

Hamza, Yasmeen, Farhadi, Afagh, Schwarz, Douglas M., McDonough, Joyce M. and Carney, Laurel H. (2023) Representations of fricatives in subcortical model responses: comparisons with human consonant perceptiona. Journal of the Acoustical Society of America, 154 (2), 602-618. (doi:10.1121/10.0020536).

Record type: Article

Abstract

Fricatives are obstruent sound contrasts made by airflow constrictions in the vocal tract that produce turbulence across the constriction or at a site downstream from the constriction. Fricatives exhibit significant intra/intersubject and contextual variability. Yet, fricatives are perceived with high accuracy. The current study investigated modeled neural responses to fricatives in the auditory nerve (AN) and inferior colliculus (IC) with the hypothesis that response profiles across populations of neurons provide robust correlates to consonant perception. Stimuli were 270 intervocalic fricatives (10 speakers × 9 fricatives × 3 utterances). Computational model response profiles had characteristic frequencies that were log-spaced from 125 Hz to 8 or 20 kHz to explore the impact of high-frequency responses. Confusion matrices generated by k-nearest-neighbor subspace classifiers were based on the profiles of average rates across characteristic frequencies as feature vectors. Model confusion matrices were compared with published behavioral data. The modeled AN and IC neural responses provided better predictions of behavioral accuracy than the stimulus spectra, and IC showed better accuracy than AN. Behavioral fricative accuracy was explained by modeled neural response profiles, whereas confusions were only partially explained. Extended frequencies improved accuracy based on the model IC, corroborating the importance of extended high frequencies in speech perception.

This record has no associated files available for download.

More information

Accepted/In Press date: 13 July 2023
Published date: 3 August 2023
Additional Information: Publisher Copyright: © 2023 Acoustical Society of America.

Identifiers

Local EPrints ID: 501553
URI: http://eprints.soton.ac.uk/id/eprint/501553
ISSN: 0001-4966
PURE UUID: 862968ce-f8ef-4b79-9189-2c8945efb3b4
ORCID for Yasmeen Hamza: ORCID iD orcid.org/0000-0003-3294-6629

Catalogue record

Date deposited: 03 Jun 2025 17:03
Last modified: 04 Jun 2025 02:11

Export record

Altmetrics

Contributors

Author: Yasmeen Hamza ORCID iD
Author: Afagh Farhadi
Author: Douglas M. Schwarz
Author: Joyce M. McDonough
Author: Laurel H. Carney

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×