The University of Southampton
University of Southampton Institutional Repository

Reducing the algorithmic variability in transcriptome-based inference

Reducing the algorithmic variability in transcriptome-based inference
Reducing the algorithmic variability in transcriptome-based inference
Motivation: High-throughput measurements of mRNA abundances from microarrays involve several stages of preprocessing. At each stage, a user has access to a large number of algorithms with no universally agreed guidance on which of these to use. We show that binary representations of gene expressions, retaining only information on whether a gene is expressed or not, reduces the variability in results caused by algorithmic choice, while also improving the quality of inference drawn from microarray studies. Results: Binary representation of transcriptome data has the desirable property of reducing the variability introduced at the preprocessing stages due to algorithmic choice. We compare the effect of the choice of algorithms on different problems and suggest that using binary representation of microarray data with Tanimoto kernel for support vector machine reduces the effect of the choice of algorithm and simultaneously improves the performance of classification of phenotypes. Contact: mn@ecs.soton.ac.uk
1367-4803
1185-1191
Tuna, Salih
10b3ffcd-3ed8-4bd5-987a-4d26946d685d
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Tuna, Salih
10b3ffcd-3ed8-4bd5-987a-4d26946d685d
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f

Tuna, Salih and Niranjan, Mahesan (2010) Reducing the algorithmic variability in transcriptome-based inference. Bioinformatics, 26 (9), 1185-1191.

Record type: Article

Abstract

Motivation: High-throughput measurements of mRNA abundances from microarrays involve several stages of preprocessing. At each stage, a user has access to a large number of algorithms with no universally agreed guidance on which of these to use. We show that binary representations of gene expressions, retaining only information on whether a gene is expressed or not, reduces the variability in results caused by algorithmic choice, while also improving the quality of inference drawn from microarray studies. Results: Binary representation of transcriptome data has the desirable property of reducing the variability introduced at the preprocessing stages due to algorithmic choice. We compare the effect of the choice of algorithms on different problems and suggest that using binary representation of microarray data with Tanimoto kernel for support vector machine reduces the effect of the choice of algorithm and simultaneously improves the performance of classification of phenotypes. Contact: mn@ecs.soton.ac.uk

Text
tuna_niranjan_bioinformatics.pdf - Accepted Manuscript
Restricted to Registered users only
Download (269kB)
Request a copy

More information

Published date: May 2010
Organisations: Southampton Wireless Group

Identifiers

Local EPrints ID: 271141
URI: http://eprints.soton.ac.uk/id/eprint/271141
ISSN: 1367-4803
PURE UUID: 87022313-5d36-454d-ad88-e0c0b907be6e
ORCID for Mahesan Niranjan: ORCID iD orcid.org/0000-0001-7021-140X

Catalogue record

Date deposited: 21 May 2010 11:48
Last modified: 15 Mar 2024 03:29

Export record

Contributors

Author: Salih Tuna
Author: Mahesan Niranjan ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×