H-Semantics: a hybrid approach to singing voice separation
H-Semantics: a hybrid approach to singing voice separation
Separating the singing voice from accompanying instruments is important in music information-retrieval systems, since it allows for such applications as melody extraction, lyrics recognition, and singer identity. The authors investigate effective methods for unsupervised separation of the singing voice, called H-Semantics (Hybrid Singing Extraction through Multiband Amplitude Enhanced Thresholding and Independent Component Subtraction). The proposed method adds time-domain separation to the previous work that was based on frequency-domain cepstral methods. The results indicate separation of approximately 8.5 dB signal-to-distortion ratio over the baseline
831-841
Sofianos, Stratis
ad0612ce-cade-4ffe-be7b-9377ef9471cc
Ariyaeeinia, Aladdin
49029cf8-701f-4170-a564-0d0e5e7c5574
Polfreman, Richard
26424c3d-b750-4868-bf6e-2bbb3990df84
Sotudeh, Reza
6dd65ec4-67db-47d1-b57b-7ab174af8d21
October 2012
Sofianos, Stratis
ad0612ce-cade-4ffe-be7b-9377ef9471cc
Ariyaeeinia, Aladdin
49029cf8-701f-4170-a564-0d0e5e7c5574
Polfreman, Richard
26424c3d-b750-4868-bf6e-2bbb3990df84
Sotudeh, Reza
6dd65ec4-67db-47d1-b57b-7ab174af8d21
Sofianos, Stratis, Ariyaeeinia, Aladdin, Polfreman, Richard and Sotudeh, Reza
(2012)
H-Semantics: a hybrid approach to singing voice separation.
Journal of the Audio Engineering Society, 60 (10), .
Abstract
Separating the singing voice from accompanying instruments is important in music information-retrieval systems, since it allows for such applications as melody extraction, lyrics recognition, and singer identity. The authors investigate effective methods for unsupervised separation of the singing voice, called H-Semantics (Hybrid Singing Extraction through Multiband Amplitude Enhanced Thresholding and Independent Component Subtraction). The proposed method adds time-domain separation to the previous work that was based on frequency-domain cepstral methods. The results indicate separation of approximately 8.5 dB signal-to-distortion ratio over the baseline
Text
JAES_V60_10_PG831.pdf
- Version of Record
Restricted to Registered users only
Request a copy
More information
Published date: October 2012
Organisations:
Faculty of Humanities
Identifiers
Local EPrints ID: 353228
URI: http://eprints.soton.ac.uk/id/eprint/353228
ISSN: 1549-4950
PURE UUID: 63f63a23-34b9-498f-9a1b-626212790f9a
Catalogue record
Date deposited: 03 Jun 2013 08:15
Last modified: 14 Mar 2024 14:03
Export record
Contributors
Author:
Stratis Sofianos
Author:
Aladdin Ariyaeeinia
Author:
Reza Sotudeh
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics