The University of Southampton
University of Southampton Institutional Repository

H-Semantics: a hybrid approach to singing voice separation

H-Semantics: a hybrid approach to singing voice separation
H-Semantics: a hybrid approach to singing voice separation
Separating the singing voice from accompanying instruments is important in music information-retrieval systems, since it allows for such applications as melody extraction, lyrics recognition, and singer identity. The authors investigate effective methods for unsupervised separation of the singing voice, called H-Semantics (Hybrid Singing Extraction through Multiband Amplitude Enhanced Thresholding and Independent Component Subtraction). The proposed method adds time-domain separation to the previous work that was based on frequency-domain cepstral methods. The results indicate separation of approximately 8.5 dB signal-to-distortion ratio over the baseline
1549-4950
831-841
Sofianos, Stratis
ad0612ce-cade-4ffe-be7b-9377ef9471cc
Ariyaeeinia, Aladdin
49029cf8-701f-4170-a564-0d0e5e7c5574
Polfreman, Richard
26424c3d-b750-4868-bf6e-2bbb3990df84
Sotudeh, Reza
6dd65ec4-67db-47d1-b57b-7ab174af8d21
Sofianos, Stratis
ad0612ce-cade-4ffe-be7b-9377ef9471cc
Ariyaeeinia, Aladdin
49029cf8-701f-4170-a564-0d0e5e7c5574
Polfreman, Richard
26424c3d-b750-4868-bf6e-2bbb3990df84
Sotudeh, Reza
6dd65ec4-67db-47d1-b57b-7ab174af8d21

Sofianos, Stratis, Ariyaeeinia, Aladdin, Polfreman, Richard and Sotudeh, Reza (2012) H-Semantics: a hybrid approach to singing voice separation. Journal of the Audio Engineering Society, 60 (10), 831-841.

Record type: Article

Abstract

Separating the singing voice from accompanying instruments is important in music information-retrieval systems, since it allows for such applications as melody extraction, lyrics recognition, and singer identity. The authors investigate effective methods for unsupervised separation of the singing voice, called H-Semantics (Hybrid Singing Extraction through Multiband Amplitude Enhanced Thresholding and Independent Component Subtraction). The proposed method adds time-domain separation to the previous work that was based on frequency-domain cepstral methods. The results indicate separation of approximately 8.5 dB signal-to-distortion ratio over the baseline

Text
JAES_V60_10_PG831.pdf - Version of Record
Restricted to Registered users only
Download (613kB)
Request a copy

More information

Published date: October 2012
Organisations: Faculty of Humanities

Identifiers

Local EPrints ID: 353228
URI: http://eprints.soton.ac.uk/id/eprint/353228
ISSN: 1549-4950
PURE UUID: 63f63a23-34b9-498f-9a1b-626212790f9a

Catalogue record

Date deposited: 03 Jun 2013 08:15
Last modified: 14 Mar 2024 14:03

Export record

Contributors

Author: Stratis Sofianos
Author: Aladdin Ariyaeeinia
Author: Reza Sotudeh

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×