The University of Southampton
University of Southampton Institutional Repository

Very low bit rate voice compression for mobile communications

Very low bit rate voice compression for mobile communications
Very low bit rate voice compression for mobile communications

This thesis is concerned with very low bit rate voice compression for mobile communications, concentrating exclusively on speech encoders beneath 4kbps, namely Prototype Waveform Interpolation (PWI), Multiband Excitation (MBE) and Sinusoidal Transform Coding (STC).

Specifically, a 1.9kbps PWI speech encoder was developed, which employed Zinc Function Excitation (ZFE) to represent the voiced speech. Analysis-by-synthesis was adopted to select the best ZFE for each voiced segment. The error sensitivity of this speech coder was assessed together with harnessing several ZFEs for each excitation optimization sub-segment, in order to create a higher speech quality, higher bit rate speech encoder.

Furthermore, several MBE speech encoders were developed, where the first version adopted simple pulse excitation with five MBE bands to create a 2.3kbps speech coder. Another speech encoder incorporated adds three MBE in the previously developed PWI-ZFE speech coder, in order to encode speech at 2.35kbps. The performance of MBE at a higher bit rate, where more frequency bands have been added, is also discussed. It was found that the PWI-ZFE speech coder with three MBE bands was preferred by 64.10% of listeners.

The third technique, namely STC was developed into a STC-PWI speech coder operating at 2.4 and 3.8kpbs. Analysis-by-synthesis was harnessed to determine the best Fourier coefficients for each segment.

A number of wavelet techniques were also examined in this thesis, which were employed in the process of pitch detection and voiced-unvoiced determination. A pitch detector with an overall pitch estimation error rate of 3.9% was developed.

University of Southampton
Brooks, Fiona Clare Angharad
Brooks, Fiona Clare Angharad

Brooks, Fiona Clare Angharad (1998) Very low bit rate voice compression for mobile communications. University of Southampton, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

This thesis is concerned with very low bit rate voice compression for mobile communications, concentrating exclusively on speech encoders beneath 4kbps, namely Prototype Waveform Interpolation (PWI), Multiband Excitation (MBE) and Sinusoidal Transform Coding (STC).

Specifically, a 1.9kbps PWI speech encoder was developed, which employed Zinc Function Excitation (ZFE) to represent the voiced speech. Analysis-by-synthesis was adopted to select the best ZFE for each voiced segment. The error sensitivity of this speech coder was assessed together with harnessing several ZFEs for each excitation optimization sub-segment, in order to create a higher speech quality, higher bit rate speech encoder.

Furthermore, several MBE speech encoders were developed, where the first version adopted simple pulse excitation with five MBE bands to create a 2.3kbps speech coder. Another speech encoder incorporated adds three MBE in the previously developed PWI-ZFE speech coder, in order to encode speech at 2.35kbps. The performance of MBE at a higher bit rate, where more frequency bands have been added, is also discussed. It was found that the PWI-ZFE speech coder with three MBE bands was preferred by 64.10% of listeners.

The third technique, namely STC was developed into a STC-PWI speech coder operating at 2.4 and 3.8kpbs. Analysis-by-synthesis was harnessed to determine the best Fourier coefficients for each segment.

A number of wavelet techniques were also examined in this thesis, which were employed in the process of pitch detection and voiced-unvoiced determination. A pitch detector with an overall pitch estimation error rate of 3.9% was developed.

This record has no associated files available for download.

More information

Published date: 1998

Identifiers

Local EPrints ID: 463688
URI: http://eprints.soton.ac.uk/id/eprint/463688
PURE UUID: bfc5c812-a229-40b2-ace8-ce6fa2ae8aed

Catalogue record

Date deposited: 04 Jul 2022 20:55
Last modified: 04 Jul 2022 20:55

Export record

Contributors

Author: Fiona Clare Angharad Brooks

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×