The University of Southampton
University of Southampton Institutional Repository

Machine learning-based DNA methylation score for fetal exposure to maternal smoking: development and validation in samples collected from adolescents and adults

Machine learning-based DNA methylation score for fetal exposure to maternal smoking: development and validation in samples collected from adolescents and adults
Machine learning-based DNA methylation score for fetal exposure to maternal smoking: development and validation in samples collected from adolescents and adults
Background: fetal exposure to maternal smoking during pregnancy is associated with the development of non-communicable diseases in the offspring. It may induce such long-term effects through persistent changes in the DNA-methylome, which therefore holds the potential to be used as a biomarker of this early life exposure. With reducing costs for measuring DNA-methylation, we aimed to develop a DNA-methylation score that can be used on adolescent DNA-methylation data and thereby generate a score for in utero smoke exposure.

Methods: we used machine learning methods to create a score reflecting exposure to maternal smoking during pregnancy. This score is based on peripheral blood measurements of DNA methylation (Illumina’s Infinium HumanMethylation450 BeadChip).
The score was developed and tested in the Raine Study with data from 995 Caucasian 17y-old participants using 10-fold cross-validation. The score was further tested and validated in independent data from the Northern Finland Birth Cohort 1986 (NFBC1986) (16y-olds) and 1966 (NFBC1966) (31y-olds). Further, three previously proposed DNA methylation scores were applied for comparison. The final score was developed with 204 CpGs using elastic net regression.

Results: sensitivity and specificity values for the best performing previously developed classifier (‘Reese Score’) were 88% and 72% for Raine, 87% and 61% for NFBC1986 and 72% and 70% for NFBC1966, respectively; corresponding figures utilizing the elastic net regression approach were 91% and 76% (Raine), 87% and 75% (NFBC1986) as well as 72% and 78% for NFBC1966.

Conclusion: we have developed a DNA methylation score for exposure to maternal smoking during pregnancy, outperforming the three previously developed scores. One possible application of the current score could be for model adjustment purposes or to assess its association with distal health outcomes where part of the effect can be attributed to maternal smoking. Further, it may provide a biomarker for fetal exposure to maternal smoking. 
0091-6765
Rauschert, Sebastian
8fd6908a-271d-4361-88eb-091199dedbf8
Melton, Phillip E.
0d57b167-a10c-4b22-83da-b983f378966f
Heiskala, Anni
af1564e8-9b1e-4909-b364-0e10dcf35a02
Karhunen, Ville
c02eac41-ebc5-475b-b0bb-3022697b856a
Burdge, Graham
09d60a07-8ca1-4351-9bf1-de6ffcfb2159
Craig, Jeffrey M.
de0b7b89-c726-4874-96dd-f3815ca7befb
Godfrey, Keith
0931701e-fe2c-44b5-8f0d-ec5c7477a6fd
Lillycrop, Karen
eeaaa78d-0c4d-4033-a178-60ce7345a2cc
Mori, Trevor A.
182e02e5-0b4a-44c2-81c6-f910c7a4d0e6
Beilin, Lawrence J
a6eec1bf-0677-4f42-a5b3-3d8f2b53f090
Oddy, Wendy H
5c1699b4-48e2-4679-9575-f20184d06f25
Pennell, Craig
8a4cb7f7-d394-49db-b32e-22adfd734fa7
Järvelin, Marjo-Riitta
eb1fa1c9-add7-4f22-b314-1548d28a286b
Sebert, Sylvain
d3f6c0f7-92b9-4959-a86c-9f7ad3346ee3
Huang, Rae-Chi
d39aca4d-8017-48c3-8f40-0aa2e52dbf66
Rauschert, Sebastian
8fd6908a-271d-4361-88eb-091199dedbf8
Melton, Phillip E.
0d57b167-a10c-4b22-83da-b983f378966f
Heiskala, Anni
af1564e8-9b1e-4909-b364-0e10dcf35a02
Karhunen, Ville
c02eac41-ebc5-475b-b0bb-3022697b856a
Burdge, Graham
09d60a07-8ca1-4351-9bf1-de6ffcfb2159
Craig, Jeffrey M.
de0b7b89-c726-4874-96dd-f3815ca7befb
Godfrey, Keith
0931701e-fe2c-44b5-8f0d-ec5c7477a6fd
Lillycrop, Karen
eeaaa78d-0c4d-4033-a178-60ce7345a2cc
Mori, Trevor A.
182e02e5-0b4a-44c2-81c6-f910c7a4d0e6
Beilin, Lawrence J
a6eec1bf-0677-4f42-a5b3-3d8f2b53f090
Oddy, Wendy H
5c1699b4-48e2-4679-9575-f20184d06f25
Pennell, Craig
8a4cb7f7-d394-49db-b32e-22adfd734fa7
Järvelin, Marjo-Riitta
eb1fa1c9-add7-4f22-b314-1548d28a286b
Sebert, Sylvain
d3f6c0f7-92b9-4959-a86c-9f7ad3346ee3
Huang, Rae-Chi
d39aca4d-8017-48c3-8f40-0aa2e52dbf66

Rauschert, Sebastian, Melton, Phillip E., Heiskala, Anni, Karhunen, Ville, Burdge, Graham, Craig, Jeffrey M., Godfrey, Keith, Lillycrop, Karen, Mori, Trevor A., Beilin, Lawrence J, Oddy, Wendy H, Pennell, Craig, Järvelin, Marjo-Riitta, Sebert, Sylvain and Huang, Rae-Chi (2020) Machine learning-based DNA methylation score for fetal exposure to maternal smoking: development and validation in samples collected from adolescents and adults. Environmental Health Perspectives. (In Press)

Record type: Article

Abstract

Background: fetal exposure to maternal smoking during pregnancy is associated with the development of non-communicable diseases in the offspring. It may induce such long-term effects through persistent changes in the DNA-methylome, which therefore holds the potential to be used as a biomarker of this early life exposure. With reducing costs for measuring DNA-methylation, we aimed to develop a DNA-methylation score that can be used on adolescent DNA-methylation data and thereby generate a score for in utero smoke exposure.

Methods: we used machine learning methods to create a score reflecting exposure to maternal smoking during pregnancy. This score is based on peripheral blood measurements of DNA methylation (Illumina’s Infinium HumanMethylation450 BeadChip).
The score was developed and tested in the Raine Study with data from 995 Caucasian 17y-old participants using 10-fold cross-validation. The score was further tested and validated in independent data from the Northern Finland Birth Cohort 1986 (NFBC1986) (16y-olds) and 1966 (NFBC1966) (31y-olds). Further, three previously proposed DNA methylation scores were applied for comparison. The final score was developed with 204 CpGs using elastic net regression.

Results: sensitivity and specificity values for the best performing previously developed classifier (‘Reese Score’) were 88% and 72% for Raine, 87% and 61% for NFBC1986 and 72% and 70% for NFBC1966, respectively; corresponding figures utilizing the elastic net regression approach were 91% and 76% (Raine), 87% and 75% (NFBC1986) as well as 72% and 78% for NFBC1966.

Conclusion: we have developed a DNA methylation score for exposure to maternal smoking during pregnancy, outperforming the three previously developed scores. One possible application of the current score could be for model adjustment purposes or to assess its association with distal health outcomes where part of the effect can be attributed to maternal smoking. Further, it may provide a biomarker for fetal exposure to maternal smoking. 

Text
Smoking_Score_Manuscript_Rauschert_final - Accepted Manuscript
Download (214kB)
Text
Figure 1
Download (39kB)
Text
Figure 2
Download (29kB)
Text
Supplement_S1_revised_new_Rauschert
Download (170kB)
Spreadsheet
Copy of Supplement_S2_revised
Download (5MB)

Show all 5 downloads.

More information

Accepted/In Press date: 29 August 2020

Identifiers

Local EPrints ID: 443740
URI: http://eprints.soton.ac.uk/id/eprint/443740
ISSN: 0091-6765
PURE UUID: af6d8eb8-eda0-432b-9ec6-3d1597ecada0
ORCID for Graham Burdge: ORCID iD orcid.org/0000-0002-7665-2967
ORCID for Keith Godfrey: ORCID iD orcid.org/0000-0002-4643-0618
ORCID for Karen Lillycrop: ORCID iD orcid.org/0000-0001-7350-5489

Catalogue record

Date deposited: 10 Sep 2020 16:46
Last modified: 17 Mar 2024 02:42

Export record

Contributors

Author: Sebastian Rauschert
Author: Phillip E. Melton
Author: Anni Heiskala
Author: Ville Karhunen
Author: Graham Burdge ORCID iD
Author: Jeffrey M. Craig
Author: Keith Godfrey ORCID iD
Author: Karen Lillycrop ORCID iD
Author: Trevor A. Mori
Author: Lawrence J Beilin
Author: Wendy H Oddy
Author: Craig Pennell
Author: Marjo-Riitta Järvelin
Author: Sylvain Sebert
Author: Rae-Chi Huang

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×