The University of Southampton
University of Southampton Institutional Repository

Uses of the pitch-scaled harmonic filter in speech processing

Uses of the pitch-scaled harmonic filter in speech processing
Uses of the pitch-scaled harmonic filter in speech processing
The pitch-scaled harmonic filter (PSHF) is a technique for decomposing speech signals into their periodic and aperiodic constituents, during periods of phonation. In this paper, the use of the PSHF for speech analysis and processing tasks is described. The periodic component can be used as an estimate of the part attributable to voicing, and the aperiodic component can act as an estimate of that attributable to turbulence noise, i.e., from fricative, aspiration and plosive sources. Here we present the algorithm for separating the periodic and aperiodic components from the pitch-scaled Fourier transform of a short section of speech, and show how to derive signals suitable for time-series analysis and for spectral analysis. These components can then be processed in a manner appropriate to their source type, for instance, extracting zeros as well as poles from the aperiodic spectral envelope. A summary of tests on synthetic speech-like signals demonstrates the robustness of the PSHF's performance to perturbations from additive noise, jitter and shimmer. Examples are given of speech analysed in various ways: power spectrum, short-time power and short-time harmonics-to-noise ratio, linear prediction and mel-frequency cepstral coefficients. Besides being valuable for speech production and perception studies, the latter two analyses show potential for incorporation into speech coding and speech recognition systems. Further uses of the PSHF are revealing normally-obscured acoustic features, exploring interactions of turbulence-noise sources with voicing, and pre-processing speech to enhance subsequent operations.
1 901 656 35 7
309-321
Jackson, P.J.B.
81dc3458-f913-44b4-9829-ecb626df5278
Shadle, C.H.
dc56253d-9926-466f-a27c-b9a8252a5304
Jackson, P.J.B.
81dc3458-f913-44b4-9829-ecb626df5278
Shadle, C.H.
dc56253d-9926-466f-a27c-b9a8252a5304

Jackson, P.J.B. and Shadle, C.H. (2001) Uses of the pitch-scaled harmonic filter in speech processing. Proceedings of the Institute of Acoustics, Workshop on Innovation in Speech Processing 2001. pp. 309-321 .

Record type: Conference or Workshop Item (Other)

Abstract

The pitch-scaled harmonic filter (PSHF) is a technique for decomposing speech signals into their periodic and aperiodic constituents, during periods of phonation. In this paper, the use of the PSHF for speech analysis and processing tasks is described. The periodic component can be used as an estimate of the part attributable to voicing, and the aperiodic component can act as an estimate of that attributable to turbulence noise, i.e., from fricative, aspiration and plosive sources. Here we present the algorithm for separating the periodic and aperiodic components from the pitch-scaled Fourier transform of a short section of speech, and show how to derive signals suitable for time-series analysis and for spectral analysis. These components can then be processed in a manner appropriate to their source type, for instance, extracting zeros as well as poles from the aperiodic spectral envelope. A summary of tests on synthetic speech-like signals demonstrates the robustness of the PSHF's performance to perturbations from additive noise, jitter and shimmer. Examples are given of speech analysed in various ways: power spectrum, short-time power and short-time harmonics-to-noise ratio, linear prediction and mel-frequency cepstral coefficients. Besides being valuable for speech production and perception studies, the latter two analyses show potential for incorporation into speech coding and speech recognition systems. Further uses of the PSHF are revealing normally-obscured acoustic features, exploring interactions of turbulence-noise sources with voicing, and pre-processing speech to enhance subsequent operations.

Text
wisp01JS.pdf - Other
Download (1MB)

More information

Published date: April 2001
Additional Information: Stratford-upon-Avon, UK, 2-3 April 2001. <br>See <a href="http://web.bham.ac.uk/p.jackson/nephthys/"> http://web.bham.ac.uk/p.jackson/nephthys/</a>for further information. Organisation: Institute of Acoustics Address: St Albans, UK
Venue - Dates: Proceedings of the Institute of Acoustics, Workshop on Innovation in Speech Processing 2001, 2001-04-01
Organisations: Electronics & Computer Science

Identifiers

Local EPrints ID: 255708
URI: http://eprints.soton.ac.uk/id/eprint/255708
ISBN: 1 901 656 35 7
PURE UUID: 5d28da53-bd31-4469-a256-2018c6381b34

Catalogue record

Date deposited: 04 Apr 2001
Last modified: 14 Mar 2024 05:33

Export record

Contributors

Author: P.J.B. Jackson
Author: C.H. Shadle

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×