How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia
How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia
Disaggregated population counts are needed to calculate health, economic, and development indicators in Low- and Middle-Income Countries (LMICs), especially in settings of rapid urbanisation. Censuses are often outdated and inaccurate in LMIC settings, and rarely disaggregated at fine geographic scale. Modelled gridded population datasets derived from census data have become widely used by development researchers and practitioners; however, accuracy in these datasets are evaluated at the spatial scale of model input data which is generally courser than the neighbourhood or cell-level scale of many applications. We simulate a realistic synthetic 2016 population in Khomas, Namibia, a majority urban region, and introduce several realistic levels of outdatedness (over 15 years) and inaccuracy in slum, non-slum, and rural areas. We aggregate the synthetic populations by census and administrative boundaries (to mimic census data), resulting in 32 gridded population datasets that are typical of LMIC settings using the WorldPop-Global-Unconstrained gridded population approach. We evaluate the cell-level accuracy of these gridded population datasets using the original synthetic population as a reference. In our simulation, we found large cell-level errors, particularly in slum cells. These were driven by the averaging of population densities in large areal units before model training. Age, accuracy, and aggregation of the input data also played a role in these errors. We suggest incorporating finer-scale training data into gridded population models generally, and WorldPop-Global-Unconstrained in particular (e.g., from routine household surveys or slum community population counts), and use of new building footprint datasets as a covariate to improve cell-level accuracy (as done in some new WorldPop-Global-Constrained datasets). It is important to measure accuracy of gridded population datasets at spatial scales more consistent with how the data are being applied, especially if they are to be used for monitoring key development indicators at neighbourhood scales within cities.
Censuses, Computer Simulation, Humans, Namibia, Population Density, Residence Characteristics, Urban Population
e0271504
Thomson, Dana R.
1ad13f81-f22e-4d89-a288-b05fb08b6c39
Leasure, Douglas R.
c025de11-3c61-45b0-9b19-68d1d37959cd
Bird, Tomas
b491394a-2b91-42d5-8262-d1c0e9ff17cd
Tzavidis, Nikos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
21 July 2022
Thomson, Dana R.
1ad13f81-f22e-4d89-a288-b05fb08b6c39
Leasure, Douglas R.
c025de11-3c61-45b0-9b19-68d1d37959cd
Bird, Tomas
b491394a-2b91-42d5-8262-d1c0e9ff17cd
Tzavidis, Nikos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
Thomson, Dana R., Leasure, Douglas R., Bird, Tomas, Tzavidis, Nikos and Tatem, Andrew J.
(2022)
How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia.
PLoS ONE, 17 (7 July), , [e0271504].
(doi:10.1371/journal.pone.0271504).
Abstract
Disaggregated population counts are needed to calculate health, economic, and development indicators in Low- and Middle-Income Countries (LMICs), especially in settings of rapid urbanisation. Censuses are often outdated and inaccurate in LMIC settings, and rarely disaggregated at fine geographic scale. Modelled gridded population datasets derived from census data have become widely used by development researchers and practitioners; however, accuracy in these datasets are evaluated at the spatial scale of model input data which is generally courser than the neighbourhood or cell-level scale of many applications. We simulate a realistic synthetic 2016 population in Khomas, Namibia, a majority urban region, and introduce several realistic levels of outdatedness (over 15 years) and inaccuracy in slum, non-slum, and rural areas. We aggregate the synthetic populations by census and administrative boundaries (to mimic census data), resulting in 32 gridded population datasets that are typical of LMIC settings using the WorldPop-Global-Unconstrained gridded population approach. We evaluate the cell-level accuracy of these gridded population datasets using the original synthetic population as a reference. In our simulation, we found large cell-level errors, particularly in slum cells. These were driven by the averaging of population densities in large areal units before model training. Age, accuracy, and aggregation of the input data also played a role in these errors. We suggest incorporating finer-scale training data into gridded population models generally, and WorldPop-Global-Unconstrained in particular (e.g., from routine household surveys or slum community population counts), and use of new building footprint datasets as a covariate to improve cell-level accuracy (as done in some new WorldPop-Global-Constrained datasets). It is important to measure accuracy of gridded population datasets at spatial scales more consistent with how the data are being applied, especially if they are to be used for monitoring key development indicators at neighbourhood scales within cities.
Text
journal.pone.0271504
- Version of Record
More information
Accepted/In Press date: 4 July 2022
Published date: 21 July 2022
Additional Information:
Funding Information:
Dana R. Thomson was funded by the Economic and Social Research Council (ESRC) grant number ES/5500161/1 (more information at https://esrc.ukri.org/). ESRC played no role in the design, analysis, decision to publish, or preparation of this manuscript. We would like to thank Drs. Angela Luna Hernandez and Ryan Engstrom for their feedback on an earlier version of this work.
Publisher Copyright:
© 2022 Thomson et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Keywords:
Censuses, Computer Simulation, Humans, Namibia, Population Density, Residence Characteristics, Urban Population
Identifiers
Local EPrints ID: 468941
URI: http://eprints.soton.ac.uk/id/eprint/468941
ISSN: 1932-6203
PURE UUID: 635c123c-cb0d-4095-9500-b652daf4f154
Catalogue record
Date deposited: 01 Sep 2022 17:01
Last modified: 17 Mar 2024 03:53
Export record
Altmetrics
Contributors
Author:
Dana R. Thomson
Author:
Douglas R. Leasure
Author:
Tomas Bird
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics