Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors
Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors
In this paper, four STFT based speech enhancement algorithms are proposed. The algorithms enhance speech by estimating its short time spectral amplitude and are combinations of two estimators (MMSE and MAP) with two speech spectral amplitude priors (Gamma and Chi). The proposed priors have a shape parameter a, whose effect on the quality of speech is a focal point of our investigation. Rather than using a priori estimated values of a, we seek those values that maximise the quality of the enhanced speech, in an a posteriori fashion. The performance of the algorithms is first evaluated as a function of the shape parameter a and optimal values are then sought by means of a formal subjective listening test. Finally, the parallel examination of four speech enhancement algorithms offers an insight into the relative importance of the employed priors and estimators, as the proposed algorithms are only different with respect to these two elements.
speech enhancement, MMSE, MAP, gamma, chi
1-14
Andrianakis, L.
6e53c0cc-763c-4dd5-9d5f-d68544c85bb2
White, P.R.
2dd2477b-5aa9-42e2-9d19-0806d994eaba
January 2009
Andrianakis, L.
6e53c0cc-763c-4dd5-9d5f-d68544c85bb2
White, P.R.
2dd2477b-5aa9-42e2-9d19-0806d994eaba
Andrianakis, L. and White, P.R.
(2009)
Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors.
Speech Communication, 51 (1), .
(doi:10.1016/j.specom.2008.05.018).
Abstract
In this paper, four STFT based speech enhancement algorithms are proposed. The algorithms enhance speech by estimating its short time spectral amplitude and are combinations of two estimators (MMSE and MAP) with two speech spectral amplitude priors (Gamma and Chi). The proposed priors have a shape parameter a, whose effect on the quality of speech is a focal point of our investigation. Rather than using a priori estimated values of a, we seek those values that maximise the quality of the enhanced speech, in an a posteriori fashion. The performance of the algorithms is first evaluated as a function of the shape parameter a and optimal values are then sought by means of a formal subjective listening test. Finally, the parallel examination of four speech enhancement algorithms offers an insight into the relative importance of the employed priors and estimators, as the proposed algorithms are only different with respect to these two elements.
This record has no associated files available for download.
More information
Published date: January 2009
Keywords:
speech enhancement, MMSE, MAP, gamma, chi
Identifiers
Local EPrints ID: 150037
URI: http://eprints.soton.ac.uk/id/eprint/150037
ISSN: 0167-6393
PURE UUID: 971c9ebf-a6b2-45ba-bf31-472454b8b45d
Catalogue record
Date deposited: 04 May 2010 09:36
Last modified: 11 Jul 2024 01:33
Export record
Altmetrics
Contributors
Author:
L. Andrianakis
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics