The University of Southampton
University of Southampton Institutional Repository

Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech

Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker's vocal tract. In this paper, we propose a method -- the pitch-scaled harmonic filter (PSHF) -- which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum-likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four reconstructed time series signals by decomposing the original speech signal, first, according to amplitude, and then according to power of the Fourier coefficients. Thus, one pair of periodic and aperiodic signals is optimized for subsequent time-series analysis, and another pair for spectral analysis. The performance of the PSHF algorithm was tested on synthetic signals, using three forms of disturbance (jitter, shimmer and additive noise), and the results were used to predict the performance on real speech. Processing recorded speech examples elicited latent features from the signals, demonstrating the PSHF's potential for analysis of mixed-source speech.
periodic-aperiodic decomposition, speech modification, speech preprocessing
713-726
Jackson, Philip J.B.
c658b148-ce3e-418d-ba12-9e970c9563dc
Shadle, Christine H.
d41e8c97-d8f9-41ca-a69a-4021e978f786
Jackson, Philip J.B.
c658b148-ce3e-418d-ba12-9e970c9563dc
Shadle, Christine H.
d41e8c97-d8f9-41ca-a69a-4021e978f786

Jackson, Philip J.B. and Shadle, Christine H. (2001) Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech. IEEE Transactions on Speech and Audio Processing, 9 (7), 713-726.

Record type: Article

Abstract

Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker's vocal tract. In this paper, we propose a method -- the pitch-scaled harmonic filter (PSHF) -- which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum-likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four reconstructed time series signals by decomposing the original speech signal, first, according to amplitude, and then according to power of the Fourier coefficients. Thus, one pair of periodic and aperiodic signals is optimized for subsequent time-series analysis, and another pair for spectral analysis. The performance of the PSHF algorithm was tested on synthetic signals, using three forms of disturbance (jitter, shimmer and additive noise), and the results were used to predict the performance on real speech. Processing recorded speech examples elicited latent features from the signals, demonstrating the PSHF's potential for analysis of mixed-source speech.

Text
ieeesap01.pdf - Other
Download (339kB)

More information

Published date: October 2001
Keywords: periodic-aperiodic decomposition, speech modification, speech preprocessing
Organisations: Electronics & Computer Science

Identifiers

Local EPrints ID: 259158
URI: http://eprints.soton.ac.uk/id/eprint/259158
PURE UUID: 4a0d72a4-4110-46f5-8c90-03bfb70e6608

Catalogue record

Date deposited: 14 Mar 2004
Last modified: 14 Mar 2024 06:20

Export record

Contributors

Author: Philip J.B. Jackson
Author: Christine H. Shadle

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×