The University of Southampton
University of Southampton Institutional Repository

Speaker identification using auditory modelling and vector quantization

Speaker identification using auditory modelling and vector quantization
Speaker identification using auditory modelling and vector quantization
This paper presents an experimental evaluation of different features for use in speaker identification (SID). The features are tested using speech data provided by the EUROM1 database, in a text-independent closed-set speaker identification task. The main objective of the paper is to present a novel parameterization of speech that is based on an auditory model called Auditory Image Model (AIM). This model provides features of the speech signal and their utility is assessed in the context of speaker identification. In order to explore the features that are more informative for predicting a speaker’s identity, the auditory image is used within the framework of cutting it into rectangles. Then, a novel strategy is incorporated for the enrolment of speakers, which is used for specifying the regions of the image that contain features that make a speaker discriminative. Afterwards, the new speaker-specific feature representation is assessed in noisy conditions that simulate a real-world environment. Their performance is compared with the results obtained adopting MFCC features in the context of a Vector Quantization (VQ) classification system. The results for the identification accuracy suggest that the new parameterization provides better results compared to conventional MFCCs especially for low SNRs.
1240-4543
283-303
Iliadi, Konstantina
ed728e5b-c03f-427e-bbd5-39ca7330acb9
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c
Iliadi, Konstantina
ed728e5b-c03f-427e-bbd5-39ca7330acb9
Bleeck, Stefan
c888ccba-e64c-47bf-b8fa-a687e87ec16c

Iliadi, Konstantina and Bleeck, Stefan (2017) Speaker identification using auditory modelling and vector quantization. Advances in Modelling and Analysis B, 60 (2), 283-303.

Record type: Article

Abstract

This paper presents an experimental evaluation of different features for use in speaker identification (SID). The features are tested using speech data provided by the EUROM1 database, in a text-independent closed-set speaker identification task. The main objective of the paper is to present a novel parameterization of speech that is based on an auditory model called Auditory Image Model (AIM). This model provides features of the speech signal and their utility is assessed in the context of speaker identification. In order to explore the features that are more informative for predicting a speaker’s identity, the auditory image is used within the framework of cutting it into rectangles. Then, a novel strategy is incorporated for the enrolment of speakers, which is used for specifying the regions of the image that contain features that make a speaker discriminative. Afterwards, the new speaker-specific feature representation is assessed in noisy conditions that simulate a real-world environment. Their performance is compared with the results obtained adopting MFCC features in the context of a Vector Quantization (VQ) classification system. The results for the identification accuracy suggest that the new parameterization provides better results compared to conventional MFCCs especially for low SNRs.

Text
Speaker Identification Iliada and Bleeck - final draft - Accepted Manuscript
Download (875kB)

More information

Accepted/In Press date: 25 September 2017
e-pub ahead of print date: 25 December 2017
Published date: 2017

Identifiers

Local EPrints ID: 418223
URI: http://eprints.soton.ac.uk/id/eprint/418223
ISSN: 1240-4543
PURE UUID: 29d52f59-12c5-428f-bcb0-1675db44f720
ORCID for Stefan Bleeck: ORCID iD orcid.org/0000-0003-4378-3394

Catalogue record

Date deposited: 23 Feb 2018 17:30
Last modified: 16 Mar 2024 03:49

Export record

Contributors

Author: Konstantina Iliadi
Author: Stefan Bleeck ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×