The University of Southampton
University of Southampton Institutional Repository

Bayesian modelling of music: algorithmic advances and experimental studies of shift-invariant sparse coding

Bayesian modelling of music: algorithmic advances and experimental studies of shift-invariant sparse coding
Bayesian modelling of music: algorithmic advances and experimental studies of shift-invariant sparse coding
In order to perform many signal processing tasks such as classification,
pattern recognition and coding, it is helpful to specify a signal model in
terms of meaningful signal structures. In general, designing such a model
is complicated and for many signals it is not feasible to specify the appropriate
structure. Adaptive models overcome this problem by learning
structures from a set of signals. Such adaptive models need to be general
enough, so that they can represent relevant structures. However, more
general models often require additional constraints to guide the learning
procedure.

In this thesis a sparse coding model is used to model time-series. Relevant
features can often occur at arbitrary locations and the model has to be
able to reflect this uncertainty, which is achieved using a shift-invariant
sparse coding formulation. In order to learn model parameters, we use
Bayesian statistical methods, however, analytic solutions to this learning
problem are not available and approximations have to be introduced. In
this thesis we study three approximations, one based on an analytical
integral approximation and two based on Monte Carlo approximations.
But even with these approximations, a solution to the learning problem
is computationally too expensive for the applications under investigation.
Therefore, we introduce further approximations by subset selection.

Music signals are highly structured time-series and offer an ideal testbed
for the studied model. We show the emergence of note- and score-like features
from a polyphonic piano recording and compare the results to those
obtained with a different model suggested in the literature. Furthermore,
we show that the model finds structures that can be assigned to an individual
source in a mixture. This is shown with an example of a mixture
containing guitar and vocal parts for which blind source separation can
be performed based on the shift-invariant sparse coding model.
Blumensath, Thomas
470d9055-0373-457e-bf80-4389f8ec4ead
Blumensath, Thomas
470d9055-0373-457e-bf80-4389f8ec4ead
Davies, Mike
3f1c4097-ef54-4f66-a4b9-dbca705775a4

Blumensath, Thomas (2006) Bayesian modelling of music: algorithmic advances and experimental studies of shift-invariant sparse coding. University College London, Department of Electronic Engineering, Doctoral Thesis, 211pp.

Record type: Thesis (Doctoral)

Abstract

In order to perform many signal processing tasks such as classification,
pattern recognition and coding, it is helpful to specify a signal model in
terms of meaningful signal structures. In general, designing such a model
is complicated and for many signals it is not feasible to specify the appropriate
structure. Adaptive models overcome this problem by learning
structures from a set of signals. Such adaptive models need to be general
enough, so that they can represent relevant structures. However, more
general models often require additional constraints to guide the learning
procedure.

In this thesis a sparse coding model is used to model time-series. Relevant
features can often occur at arbitrary locations and the model has to be
able to reflect this uncertainty, which is achieved using a shift-invariant
sparse coding formulation. In order to learn model parameters, we use
Bayesian statistical methods, however, analytic solutions to this learning
problem are not available and approximations have to be introduced. In
this thesis we study three approximations, one based on an analytical
integral approximation and two based on Monte Carlo approximations.
But even with these approximations, a solution to the learning problem
is computationally too expensive for the applications under investigation.
Therefore, we introduce further approximations by subset selection.

Music signals are highly structured time-series and offer an ideal testbed
for the studied model. We show the emergence of note- and score-like features
from a polyphonic piano recording and compare the results to those
obtained with a different model suggested in the literature. Furthermore,
we show that the model finds structures that can be assigned to an individual
source in a mixture. This is shown with an example of a mixture
containing guitar and vocal parts for which blind source separation can
be performed based on the shift-invariant sparse coding model.

Text
BlumensathThesis.pdf - Other
Download (3MB)

More information

Published date: 2006
Organisations: Signal Processing & Control Grp

Identifiers

Local EPrints ID: 142461
URI: http://eprints.soton.ac.uk/id/eprint/142461
PURE UUID: a23b7682-cf5b-4639-bf13-98f934caa542
ORCID for Thomas Blumensath: ORCID iD orcid.org/0000-0002-7489-265X

Catalogue record

Date deposited: 11 Jun 2010 14:29
Last modified: 14 Mar 2024 02:55

Export record

Contributors

Thesis advisor: Mike Davies

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×