Characterization of data analysis methods for information recovery from metabolic 1 H NMR spectra using artificial complex mixtures
Characterization of data analysis methods for information recovery from metabolic 1 H NMR spectra using artificial complex mixtures
The assessment of data analysis methods in 1H NMR based metabolic profiling is hampered owing to a lack of knowledge of the exact sample composition. In this study, an artificial complex mixture design comprising two artificially defined groups designated normal and disease, each containing 30 samples, was implemented using 21 metabolites at concentrations typically found in human urine and having a realistic distribution of inter-metabolite correlations. These artificial mixtures were profiled by 1H NMR spectroscopy and used to assess data analytical methods in the task of differentiating the two conditions. When metabolites were individually quantified, volcano plots provided an excellent method to track the effect size and significance of the change between conditions. Interestingly, the Welch t test detected a similar set of metabolites changing between classes in both quantified and spectral data, suggesting that differential analysis of 1H NMR spectra using a false discovery rate correction, taking into account fold changes, is a reliable approach to detect differential metabolites in complex mixture studies. Various multivariate regression methods based on partial least squares (PLS) were applied in discriminant analysis mode. The most reliable methods in quantified and spectral 1H NMR data were PLS and RPLS linear and logistic regression respectively. A jackknife based strategy for variable selection was assessed on both quantified and spectral data and results indicate that it may be possible to improve on the conventional Orthogonal-PLS methodology in terms of accuracy and sensitivity. A key improvement of our approach consists of objective criteria to select significant signals associated with a condition that provides a confidence level on the discoveries made, which can be implemented in metabolic profiling studies.
Couto Alves, Alexessander
87b9179e-abde-4ca5-abfc-4b7c5ac8b03b
Li, Jia V.
ad266e35-53c1-45c3-9756-4fb6b0386e1c
Garcia-Perez, Isabel
1fc1a078-979c-4f3c-bc09-d09830e6a2be
Sands, Caroline
c0ec736e-0947-4784-8e50-373efdf388bd
Barbas, Coral
4adae11a-2120-46d9-a4a2-e2e4dcc84695
Holmes, Elaine
bbb8e82c-7391-475a-8930-b86259e85424
Ebbels, Timothy M.D.
89aabc3f-5990-4b2d-817a-ce44ddfe2928
27 April 2012
Couto Alves, Alexessander
87b9179e-abde-4ca5-abfc-4b7c5ac8b03b
Li, Jia V.
ad266e35-53c1-45c3-9756-4fb6b0386e1c
Garcia-Perez, Isabel
1fc1a078-979c-4f3c-bc09-d09830e6a2be
Sands, Caroline
c0ec736e-0947-4784-8e50-373efdf388bd
Barbas, Coral
4adae11a-2120-46d9-a4a2-e2e4dcc84695
Holmes, Elaine
bbb8e82c-7391-475a-8930-b86259e85424
Ebbels, Timothy M.D.
89aabc3f-5990-4b2d-817a-ce44ddfe2928
Couto Alves, Alexessander, Li, Jia V., Garcia-Perez, Isabel, Sands, Caroline, Barbas, Coral, Holmes, Elaine and Ebbels, Timothy M.D.
(2012)
Characterization of data analysis methods for information recovery from metabolic 1 H NMR spectra using artificial complex mixtures.
Metabolomics.
(doi:10.1007/s11306-012-0422-8).
Abstract
The assessment of data analysis methods in 1H NMR based metabolic profiling is hampered owing to a lack of knowledge of the exact sample composition. In this study, an artificial complex mixture design comprising two artificially defined groups designated normal and disease, each containing 30 samples, was implemented using 21 metabolites at concentrations typically found in human urine and having a realistic distribution of inter-metabolite correlations. These artificial mixtures were profiled by 1H NMR spectroscopy and used to assess data analytical methods in the task of differentiating the two conditions. When metabolites were individually quantified, volcano plots provided an excellent method to track the effect size and significance of the change between conditions. Interestingly, the Welch t test detected a similar set of metabolites changing between classes in both quantified and spectral data, suggesting that differential analysis of 1H NMR spectra using a false discovery rate correction, taking into account fold changes, is a reliable approach to detect differential metabolites in complex mixture studies. Various multivariate regression methods based on partial least squares (PLS) were applied in discriminant analysis mode. The most reliable methods in quantified and spectral 1H NMR data were PLS and RPLS linear and logistic regression respectively. A jackknife based strategy for variable selection was assessed on both quantified and spectral data and results indicate that it may be possible to improve on the conventional Orthogonal-PLS methodology in terms of accuracy and sensitivity. A key improvement of our approach consists of objective criteria to select significant signals associated with a condition that provides a confidence level on the discoveries made, which can be implemented in metabolic profiling studies.
Text
s11306-012-0422-8
- Version of Record
Restricted to Repository staff only
Request a copy
More information
Accepted/In Press date: 29 March 2012
Published date: 27 April 2012
Identifiers
Local EPrints ID: 494685
URI: http://eprints.soton.ac.uk/id/eprint/494685
ISSN: 1573-3882
PURE UUID: eb0e52cb-6104-4494-85b6-1d59afb1f980
Catalogue record
Date deposited: 14 Oct 2024 16:37
Last modified: 19 Oct 2024 02:14
Export record
Altmetrics
Contributors
Author:
Alexessander Couto Alves
Author:
Jia V. Li
Author:
Isabel Garcia-Perez
Author:
Caroline Sands
Author:
Coral Barbas
Author:
Elaine Holmes
Author:
Timothy M.D. Ebbels
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics