Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data
Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data
Genome-wide association studies (GWASs) identify single nucleotide polymorphisms (SNPs) that are enriched in individuals suffering from a given disease. Most disease-associated SNPs fall into non-coding regions, so that it is not straightforward to infer phenotype or function; moreover, many SNPs are in tight genetic linkage, so that a SNP identified as associated with a particular disease may not itself be causal, but rather signify the presence of a linked SNP that is functionally relevant to disease pathogenesis. Here, we present an analysis method that takes advantage of the recent rapid accumulation of epigenomics data to address these problems for some SNPs. Using asthma as a prototypic example; we show that non-coding disease-associated SNPs are enriched in genomic regions that function as regulators of transcription, such as enhancers and promoters. Identifying enhancers based on the presence of the histone modification marks such as H3K4me1 in different cell types, we show that the location of enhancers is highly cell-type specific. We use these findings to predict which SNPs are likely to be directly contributing to disease based on their presence in regulatory regions, and in which cell types their effect is expected to be detectable. Moreover, we can also predict which cell types contribute to a disease based on overlap of the disease-associated SNPs with the locations of enhancers present in a given cell type. Finally, we suggest that it will be possible to re-analyze GWAS studies with much higher power by limiting the SNPs considered to those in coding or regulatory regions of cell types relevant to a given disease.
e54359
Gerasimova, A.
2e5e3517-b0b9-43fc-8572-b89c86fa2d8e
Chavez, L.
436a0bf2-347d-49cf-a937-cac0f6b30787
Li, B.
a8225836-5ea0-4274-8d4f-57a50a26ed59
Seumois, Gregory
0be7d3d6-5526-458c-aa5c-cce52410a2ed
Greenbaum, J.
255cadbf-878c-4b81-a618-8286edd90270
Rao, A.
48e0dfb7-cd0d-4055-a40f-b5cab3789bb4
Vijayanand, P.
79514f33-66cf-47cc-a8fa-46bbfc21b7d1
Peters, B.
044a12e3-64de-49b0-82f1-dfda5233a35e
30 January 2013
Gerasimova, A.
2e5e3517-b0b9-43fc-8572-b89c86fa2d8e
Chavez, L.
436a0bf2-347d-49cf-a937-cac0f6b30787
Li, B.
a8225836-5ea0-4274-8d4f-57a50a26ed59
Seumois, Gregory
0be7d3d6-5526-458c-aa5c-cce52410a2ed
Greenbaum, J.
255cadbf-878c-4b81-a618-8286edd90270
Rao, A.
48e0dfb7-cd0d-4055-a40f-b5cab3789bb4
Vijayanand, P.
79514f33-66cf-47cc-a8fa-46bbfc21b7d1
Peters, B.
044a12e3-64de-49b0-82f1-dfda5233a35e
Gerasimova, A., Chavez, L., Li, B., Seumois, Gregory, Greenbaum, J., Rao, A., Vijayanand, P. and Peters, B.
(2013)
Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data.
PLoS ONE, 8 (1), .
(doi:10.1371/journal.pone.0054359).
Abstract
Genome-wide association studies (GWASs) identify single nucleotide polymorphisms (SNPs) that are enriched in individuals suffering from a given disease. Most disease-associated SNPs fall into non-coding regions, so that it is not straightforward to infer phenotype or function; moreover, many SNPs are in tight genetic linkage, so that a SNP identified as associated with a particular disease may not itself be causal, but rather signify the presence of a linked SNP that is functionally relevant to disease pathogenesis. Here, we present an analysis method that takes advantage of the recent rapid accumulation of epigenomics data to address these problems for some SNPs. Using asthma as a prototypic example; we show that non-coding disease-associated SNPs are enriched in genomic regions that function as regulators of transcription, such as enhancers and promoters. Identifying enhancers based on the presence of the histone modification marks such as H3K4me1 in different cell types, we show that the location of enhancers is highly cell-type specific. We use these findings to predict which SNPs are likely to be directly contributing to disease based on their presence in regulatory regions, and in which cell types their effect is expected to be detectable. Moreover, we can also predict which cell types contribute to a disease based on overlap of the disease-associated SNPs with the locations of enhancers present in a given cell type. Finally, we suggest that it will be possible to re-analyze GWAS studies with much higher power by limiting the SNPs considered to those in coding or regulatory regions of cell types relevant to a given disease.
Other
fetchObject.action_uri=info_doi%2F10.1371%2Fjournal.pone.0054359&representation=PDF
- Version of Record
Available under License Other.
More information
Published date: 30 January 2013
Organisations:
Clinical & Experimental Sciences
Identifiers
Local EPrints ID: 349167
URI: http://eprints.soton.ac.uk/id/eprint/349167
ISSN: 1932-6203
PURE UUID: a530edfe-74d5-4de4-9fca-c86e3a848fab
Catalogue record
Date deposited: 26 Feb 2013 14:58
Last modified: 14 Mar 2024 13:09
Export record
Altmetrics
Contributors
Author:
A. Gerasimova
Author:
L. Chavez
Author:
B. Li
Author:
Gregory Seumois
Author:
J. Greenbaum
Author:
A. Rao
Author:
P. Vijayanand
Author:
B. Peters
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics