The University of Southampton
University of Southampton Institutional Repository

Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors

Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors
Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors
In this paper, four STFT based speech enhancement algorithms are proposed. The algorithms enhance speech by estimating its short time spectral amplitude and are combinations of two estimators (MMSE and MAP) with two speech spectral amplitude priors (Gamma and Chi). The proposed priors have a shape parameter a, whose effect on the quality of speech is a focal point of our investigation. Rather than using a priori estimated values of a, we seek those values that maximise the quality of the enhanced speech, in an a posteriori fashion. The performance of the algorithms is first evaluated as a function of the shape parameter a and optimal values are then sought by means of a formal subjective listening test. Finally, the parallel examination of four speech enhancement algorithms offers an insight into the relative importance of the employed priors and estimators, as the proposed algorithms are only different with respect to these two elements.
speech enhancement, MMSE, MAP, gamma, chi
0167-6393
1-14
Andrianakis, L.
6e53c0cc-763c-4dd5-9d5f-d68544c85bb2
White, P.R.
2dd2477b-5aa9-42e2-9d19-0806d994eaba
Andrianakis, L.
6e53c0cc-763c-4dd5-9d5f-d68544c85bb2
White, P.R.
2dd2477b-5aa9-42e2-9d19-0806d994eaba

Andrianakis, L. and White, P.R. (2009) Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors. Speech Communication, 51 (1), 1-14. (doi:10.1016/j.specom.2008.05.018).

Record type: Article

Abstract

In this paper, four STFT based speech enhancement algorithms are proposed. The algorithms enhance speech by estimating its short time spectral amplitude and are combinations of two estimators (MMSE and MAP) with two speech spectral amplitude priors (Gamma and Chi). The proposed priors have a shape parameter a, whose effect on the quality of speech is a focal point of our investigation. Rather than using a priori estimated values of a, we seek those values that maximise the quality of the enhanced speech, in an a posteriori fashion. The performance of the algorithms is first evaluated as a function of the shape parameter a and optimal values are then sought by means of a formal subjective listening test. Finally, the parallel examination of four speech enhancement algorithms offers an insight into the relative importance of the employed priors and estimators, as the proposed algorithms are only different with respect to these two elements.

This record has no associated files available for download.

More information

Published date: January 2009
Keywords: speech enhancement, MMSE, MAP, gamma, chi

Identifiers

Local EPrints ID: 150037
URI: http://eprints.soton.ac.uk/id/eprint/150037
ISSN: 0167-6393
PURE UUID: 971c9ebf-a6b2-45ba-bf31-472454b8b45d
ORCID for P.R. White: ORCID iD orcid.org/0000-0002-4787-8713

Catalogue record

Date deposited: 04 May 2010 09:36
Last modified: 14 Mar 2024 02:34

Export record

Altmetrics

Contributors

Author: L. Andrianakis
Author: P.R. White ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×