Reducing the algorithmic variability in transcriptome-based inference
Reducing the algorithmic variability in transcriptome-based inference
Motivation: High-throughput measurements of mRNA abundances from microarrays involve several stages of preprocessing. At each stage, a user has access to a large number of algorithms with no universally agreed guidance on which of these to use. We show that binary representations of gene expressions, retaining only information on whether a gene is expressed or not, reduces the variability in results caused by algorithmic choice, while also improving the quality of inference drawn from microarray studies. Results: Binary representation of transcriptome data has the desirable property of reducing the variability introduced at the preprocessing stages due to algorithmic choice. We compare the effect of the choice of algorithms on different problems and suggest that using binary representation of microarray data with Tanimoto kernel for support vector machine reduces the effect of the choice of algorithm and simultaneously improves the performance of classification of phenotypes. Contact: mn@ecs.soton.ac.uk
1185-1191
Tuna, Salih
10b3ffcd-3ed8-4bd5-987a-4d26946d685d
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
May 2010
Tuna, Salih
10b3ffcd-3ed8-4bd5-987a-4d26946d685d
Niranjan, Mahesan
5cbaeea8-7288-4b55-a89c-c43d212ddd4f
Tuna, Salih and Niranjan, Mahesan
(2010)
Reducing the algorithmic variability in transcriptome-based inference.
Bioinformatics, 26 (9), .
Abstract
Motivation: High-throughput measurements of mRNA abundances from microarrays involve several stages of preprocessing. At each stage, a user has access to a large number of algorithms with no universally agreed guidance on which of these to use. We show that binary representations of gene expressions, retaining only information on whether a gene is expressed or not, reduces the variability in results caused by algorithmic choice, while also improving the quality of inference drawn from microarray studies. Results: Binary representation of transcriptome data has the desirable property of reducing the variability introduced at the preprocessing stages due to algorithmic choice. We compare the effect of the choice of algorithms on different problems and suggest that using binary representation of microarray data with Tanimoto kernel for support vector machine reduces the effect of the choice of algorithm and simultaneously improves the performance of classification of phenotypes. Contact: mn@ecs.soton.ac.uk
Text
tuna_niranjan_bioinformatics.pdf
- Accepted Manuscript
Restricted to Registered users only
Request a copy
More information
Published date: May 2010
Organisations:
Southampton Wireless Group
Identifiers
Local EPrints ID: 271141
URI: http://eprints.soton.ac.uk/id/eprint/271141
ISSN: 1367-4803
PURE UUID: 87022313-5d36-454d-ad88-e0c0b907be6e
Catalogue record
Date deposited: 21 May 2010 11:48
Last modified: 15 Mar 2024 03:29
Export record
Contributors
Author:
Salih Tuna
Author:
Mahesan Niranjan
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics