Geostatistical and machine learning approaches for high-resolution mapping of vaccination coverage
Geostatistical and machine learning approaches for high-resolution mapping of vaccination coverage
Recently, there has been a growing interest in the production of high-resolution maps of vaccination coverage. These maps have been useful for uncovering geographic inequities in coverage and improving targeting of interventions to reach marginalized populations. Different methodological approaches have been developed for producing these maps using mostly geolocated household survey data and geospatial covariate information. However, it remains unclear how much the predicted coverage maps produced by the various methods differ, and which methods yield more reliable estimates. Here, we explore the predictive performance of these methods and resulting implications for spatial prioritization to fill this gap. Using Nigeria Demographic and Health Survey as a case study, we generate 1x1 km and district level maps of indicators of vaccination coverage using geostatistical, machine learning (ML) and hybrid methods and evaluate predictive performance via cross-validation. Our results show similar predictive performance for five of the seven methods investigated, although two geostatistical approaches are the best performing methods. The worst-performing methods are two ML approaches. We find marked differences in spatial prioritization using these methods, which could potentially result in missing important underserved populations, although broad similarities exist. Our study can help guide map production for other health and development metrics.
Bayesian geostatistics, Demographic and Health Surveys, Health and development indicators, INLA-SPDE, Machine learning, Vaccination coverage
Utazi, C. Edson
e69ca81e-fb23-4bc1-99a5-25c9e0f4d6f9
Yankey, Ortis
9965d053-8afb-462f-b7fe-b270e21f2ec1
Chaudhuri, Somnath
ae0507e0-f920-4438-bc9f-ecdd5ac8967a
Olowe, Iyanuloluwa D.
3993579e-505f-49c6-a35d-0e83d882c3fa
Danovaro-Holliday, M. Carolina
a5112753-8972-4473-91d9-0677b04fbeae
Lazar, Attila N.
d7f835e7-1e3d-4742-b366-af19cf5fc881
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
23 August 2025
Utazi, C. Edson
e69ca81e-fb23-4bc1-99a5-25c9e0f4d6f9
Yankey, Ortis
9965d053-8afb-462f-b7fe-b270e21f2ec1
Chaudhuri, Somnath
ae0507e0-f920-4438-bc9f-ecdd5ac8967a
Olowe, Iyanuloluwa D.
3993579e-505f-49c6-a35d-0e83d882c3fa
Danovaro-Holliday, M. Carolina
a5112753-8972-4473-91d9-0677b04fbeae
Lazar, Attila N.
d7f835e7-1e3d-4742-b366-af19cf5fc881
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
Utazi, C. Edson, Yankey, Ortis, Chaudhuri, Somnath, Olowe, Iyanuloluwa D., Danovaro-Holliday, M. Carolina, Lazar, Attila N. and Tatem, Andrew J.
(2025)
Geostatistical and machine learning approaches for high-resolution mapping of vaccination coverage.
Spatial and Spatio-temporal Epidemiology, 54, [100744].
(doi:10.1016/j.sste.2025.100744).
Abstract
Recently, there has been a growing interest in the production of high-resolution maps of vaccination coverage. These maps have been useful for uncovering geographic inequities in coverage and improving targeting of interventions to reach marginalized populations. Different methodological approaches have been developed for producing these maps using mostly geolocated household survey data and geospatial covariate information. However, it remains unclear how much the predicted coverage maps produced by the various methods differ, and which methods yield more reliable estimates. Here, we explore the predictive performance of these methods and resulting implications for spatial prioritization to fill this gap. Using Nigeria Demographic and Health Survey as a case study, we generate 1x1 km and district level maps of indicators of vaccination coverage using geostatistical, machine learning (ML) and hybrid methods and evaluate predictive performance via cross-validation. Our results show similar predictive performance for five of the seven methods investigated, although two geostatistical approaches are the best performing methods. The worst-performing methods are two ML approaches. We find marked differences in spatial prioritization using these methods, which could potentially result in missing important underserved populations, although broad similarities exist. Our study can help guide map production for other health and development metrics.
Text
Accepted manuscript
- Accepted Manuscript
Text
1-s2.0-S1877584525000358-main
- Version of Record
More information
Accepted/In Press date: 18 August 2025
e-pub ahead of print date: 19 August 2025
Published date: 23 August 2025
Keywords:
Bayesian geostatistics, Demographic and Health Surveys, Health and development indicators, INLA-SPDE, Machine learning, Vaccination coverage
Identifiers
Local EPrints ID: 505092
URI: http://eprints.soton.ac.uk/id/eprint/505092
ISSN: 1877-5845
PURE UUID: 15a29539-1bf5-4068-b881-542bc6611b7e
Catalogue record
Date deposited: 25 Sep 2025 17:15
Last modified: 10 Oct 2025 02:11
Export record
Altmetrics
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics