The University of Southampton
University of Southampton Institutional Repository

Advancing pediatric and longitudinal DNA methylation studies with CellsPickMe, an integrated blood cell deconvolution method

Advancing pediatric and longitudinal DNA methylation studies with CellsPickMe, an integrated blood cell deconvolution method
Advancing pediatric and longitudinal DNA methylation studies with CellsPickMe, an integrated blood cell deconvolution method
Prospective birth cohorts offer the potential to interrogate the relation between early life environment and embedded biological processes such as DNA methylation (DNAme). These association studies are frequently conducted in the context of blood, a heterogeneous tissue composed of diverse cell types. Accounting for this cellular heterogeneity across samples is essential, as it is a main contributor to inter-individual DNAme variation. Integrated blood cell deconvolution of pediatric and longitudinal birth cohorts poses a major challenge, as existing methods fail to account for the distinct cell population shift between birth and adolescence. In this paper, we critically evaluated the reference-based deconvolution procedure and optimized its prediction accuracy for longitudinal birth cohorts using DNAme data from the Canadian Healthy Infant Longitudinal Development (CHILD) cohort. The optimized algorithm, CellsPickMe, integrates cord and adult references and picks DNAme features for each population of cells with machine learning algorithms. It demonstrated improved deconvolution accuracy in cord, pediatric, and adult blood samples compared to existing benchmark methods. CellsPickMe supports blood cell deconvolution across early developmental periods under a single framework, enabling cross-time-point integration of longitudinal DNAme studies. Given the increased resolution of cell populations predicted by CellsPickMe, this R package empowers researchers to explore immune system dynamics using DNAme data in population studies across the life course.
bioRxiv
Fu, Maggie P.
6ae6e7ac-1585-43d6-b6c9-1294f4f5203f
Edwards, Karlie
88ab2ce0-de81-4823-a982-4d63a5bb17f1
Navarro-Delgado, Erick I.
ed26b11a-0e90-4aaf-a79a-ede4ef88c7f4
Merrill, Sarah M.
08e80b8d-b695-42cf-ba87-c33e98251af7
Kitaba, Negusse T.
5e35ae4a-edaa-4b78-bcb6-00628c3b6e83
Konwar, Chaini
3d2bc48d-4f9f-4478-a424-96dbabd31ed2
Mandhane, Piush
dd6b6e6a-dfa3-4cca-a591-0b950f6df5c6
Simons, Elinor
c50c713a-c904-4d3d-9b97-042b7f602b08
Subbarao, Padmaja
c0a3d512-ee92-416e-9911-21c7854ced01
Moraes, Theo J.
5d5dfdff-3ec0-4f79-861f-89de13784ac8
Holloway, John W.
4bbd77e6-c095-445d-a36b-a50a72f6fe1a
Turvey, Stuart E.
8df089eb-c556-4431-872d-c80316570e26
Kobor, Michael S.
e387ab6f-d060-4d39-95c6-acf0c3b9687b
Fu, Maggie P.
6ae6e7ac-1585-43d6-b6c9-1294f4f5203f
Edwards, Karlie
88ab2ce0-de81-4823-a982-4d63a5bb17f1
Navarro-Delgado, Erick I.
ed26b11a-0e90-4aaf-a79a-ede4ef88c7f4
Merrill, Sarah M.
08e80b8d-b695-42cf-ba87-c33e98251af7
Kitaba, Negusse T.
5e35ae4a-edaa-4b78-bcb6-00628c3b6e83
Konwar, Chaini
3d2bc48d-4f9f-4478-a424-96dbabd31ed2
Mandhane, Piush
dd6b6e6a-dfa3-4cca-a591-0b950f6df5c6
Simons, Elinor
c50c713a-c904-4d3d-9b97-042b7f602b08
Subbarao, Padmaja
c0a3d512-ee92-416e-9911-21c7854ced01
Moraes, Theo J.
5d5dfdff-3ec0-4f79-861f-89de13784ac8
Holloway, John W.
4bbd77e6-c095-445d-a36b-a50a72f6fe1a
Turvey, Stuart E.
8df089eb-c556-4431-872d-c80316570e26
Kobor, Michael S.
e387ab6f-d060-4d39-95c6-acf0c3b9687b

[Unknown type: UNSPECIFIED]

Record type: UNSPECIFIED

Abstract

Prospective birth cohorts offer the potential to interrogate the relation between early life environment and embedded biological processes such as DNA methylation (DNAme). These association studies are frequently conducted in the context of blood, a heterogeneous tissue composed of diverse cell types. Accounting for this cellular heterogeneity across samples is essential, as it is a main contributor to inter-individual DNAme variation. Integrated blood cell deconvolution of pediatric and longitudinal birth cohorts poses a major challenge, as existing methods fail to account for the distinct cell population shift between birth and adolescence. In this paper, we critically evaluated the reference-based deconvolution procedure and optimized its prediction accuracy for longitudinal birth cohorts using DNAme data from the Canadian Healthy Infant Longitudinal Development (CHILD) cohort. The optimized algorithm, CellsPickMe, integrates cord and adult references and picks DNAme features for each population of cells with machine learning algorithms. It demonstrated improved deconvolution accuracy in cord, pediatric, and adult blood samples compared to existing benchmark methods. CellsPickMe supports blood cell deconvolution across early developmental periods under a single framework, enabling cross-time-point integration of longitudinal DNAme studies. Given the increased resolution of cell populations predicted by CellsPickMe, this R package empowers researchers to explore immune system dynamics using DNAme data in population studies across the life course.

Text
2025.04.22.649907v1.full - Author's Original
Download (2MB)

More information

Published date: 24 April 2025

Identifiers

Local EPrints ID: 502872
URI: http://eprints.soton.ac.uk/id/eprint/502872
PURE UUID: e8358144-bc10-47b4-887b-f652e2c08811
ORCID for Negusse T. Kitaba: ORCID iD orcid.org/0000-0001-7518-9096
ORCID for John W. Holloway: ORCID iD orcid.org/0000-0001-9998-0464

Catalogue record

Date deposited: 10 Jul 2025 17:19
Last modified: 22 Aug 2025 02:19

Export record

Altmetrics

Contributors

Author: Maggie P. Fu
Author: Karlie Edwards
Author: Erick I. Navarro-Delgado
Author: Sarah M. Merrill
Author: Chaini Konwar
Author: Piush Mandhane
Author: Elinor Simons
Author: Padmaja Subbarao
Author: Theo J. Moraes
Author: Stuart E. Turvey
Author: Michael S. Kobor

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×