Uses of the pitch-scaled harmonic filter in speech processing
Uses of the pitch-scaled harmonic filter in speech processing
The pitch-scaled harmonic filter (PSHF) is a technique for decomposing speech signals into their periodic and aperiodic constituents, during periods of phonation. In this paper, the use of the PSHF for speech analysis and processing tasks is described. The periodic component can be used as an estimate of the part attributable to voicing, and the aperiodic component can act as an estimate of that attributable to turbulence noise, i.e., from fricative, aspiration and plosive sources. Here we present the algorithm for separating the periodic and aperiodic components from the pitch-scaled Fourier transform of a short section of speech, and show how to derive signals suitable for time-series analysis and for spectral analysis. These components can then be processed in a manner appropriate to their source type, for instance, extracting zeros as well as poles from the aperiodic spectral envelope. A summary of tests on synthetic speech-like signals demonstrates the robustness of the PSHF's performance to perturbations from additive noise, jitter and shimmer. Examples are given of speech analysed in various ways: power spectrum, short-time power and short-time harmonics-to-noise ratio, linear prediction and mel-frequency cepstral coefficients. Besides being valuable for speech production and perception studies, the latter two analyses show potential for incorporation into speech coding and speech recognition systems. Further uses of the PSHF are revealing normally-obscured acoustic features, exploring interactions of turbulence-noise sources with voicing, and pre-processing speech to enhance subsequent operations.
1 901 656 35 7
309-321
Jackson, P.J.B.
81dc3458-f913-44b4-9829-ecb626df5278
Shadle, C.H.
dc56253d-9926-466f-a27c-b9a8252a5304
April 2001
Jackson, P.J.B.
81dc3458-f913-44b4-9829-ecb626df5278
Shadle, C.H.
dc56253d-9926-466f-a27c-b9a8252a5304
Jackson, P.J.B. and Shadle, C.H.
(2001)
Uses of the pitch-scaled harmonic filter in speech processing.
Proceedings of the Institute of Acoustics, Workshop on Innovation in Speech Processing 2001.
.
Record type:
Conference or Workshop Item
(Other)
Abstract
The pitch-scaled harmonic filter (PSHF) is a technique for decomposing speech signals into their periodic and aperiodic constituents, during periods of phonation. In this paper, the use of the PSHF for speech analysis and processing tasks is described. The periodic component can be used as an estimate of the part attributable to voicing, and the aperiodic component can act as an estimate of that attributable to turbulence noise, i.e., from fricative, aspiration and plosive sources. Here we present the algorithm for separating the periodic and aperiodic components from the pitch-scaled Fourier transform of a short section of speech, and show how to derive signals suitable for time-series analysis and for spectral analysis. These components can then be processed in a manner appropriate to their source type, for instance, extracting zeros as well as poles from the aperiodic spectral envelope. A summary of tests on synthetic speech-like signals demonstrates the robustness of the PSHF's performance to perturbations from additive noise, jitter and shimmer. Examples are given of speech analysed in various ways: power spectrum, short-time power and short-time harmonics-to-noise ratio, linear prediction and mel-frequency cepstral coefficients. Besides being valuable for speech production and perception studies, the latter two analyses show potential for incorporation into speech coding and speech recognition systems. Further uses of the PSHF are revealing normally-obscured acoustic features, exploring interactions of turbulence-noise sources with voicing, and pre-processing speech to enhance subsequent operations.
Text
wisp01JS.pdf
- Other
More information
Published date: April 2001
Additional Information:
Stratford-upon-Avon, UK, 2-3 April 2001. <br>See <a href="http://web.bham.ac.uk/p.jackson/nephthys/"> http://web.bham.ac.uk/p.jackson/nephthys/</a>for further information. Organisation: Institute of Acoustics Address: St Albans, UK
Venue - Dates:
Proceedings of the Institute of Acoustics, Workshop on Innovation in Speech Processing 2001, 2001-04-01
Organisations:
Electronics & Computer Science
Identifiers
Local EPrints ID: 255708
URI: http://eprints.soton.ac.uk/id/eprint/255708
ISBN: 1 901 656 35 7
PURE UUID: 5d28da53-bd31-4469-a256-2018c6381b34
Catalogue record
Date deposited: 04 Apr 2001
Last modified: 14 Mar 2024 05:33
Export record
Contributors
Author:
P.J.B. Jackson
Author:
C.H. Shadle
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics