The University of Southampton
University of Southampton Institutional Repository

How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia

How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia
How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia

Disaggregated population counts are needed to calculate health, economic, and development indicators in Low- and Middle-Income Countries (LMICs), especially in settings of rapid urbanisation. Censuses are often outdated and inaccurate in LMIC settings, and rarely disaggregated at fine geographic scale. Modelled gridded population datasets derived from census data have become widely used by development researchers and practitioners; however, accuracy in these datasets are evaluated at the spatial scale of model input data which is generally courser than the neighbourhood or cell-level scale of many applications. We simulate a realistic synthetic 2016 population in Khomas, Namibia, a majority urban region, and introduce several realistic levels of outdatedness (over 15 years) and inaccuracy in slum, non-slum, and rural areas. We aggregate the synthetic populations by census and administrative boundaries (to mimic census data), resulting in 32 gridded population datasets that are typical of LMIC settings using the WorldPop-Global-Unconstrained gridded population approach. We evaluate the cell-level accuracy of these gridded population datasets using the original synthetic population as a reference. In our simulation, we found large cell-level errors, particularly in slum cells. These were driven by the averaging of population densities in large areal units before model training. Age, accuracy, and aggregation of the input data also played a role in these errors. We suggest incorporating finer-scale training data into gridded population models generally, and WorldPop-Global-Unconstrained in particular (e.g., from routine household surveys or slum community population counts), and use of new building footprint datasets as a covariate to improve cell-level accuracy (as done in some new WorldPop-Global-Constrained datasets). It is important to measure accuracy of gridded population datasets at spatial scales more consistent with how the data are being applied, especially if they are to be used for monitoring key development indicators at neighbourhood scales within cities.

Censuses, Computer Simulation, Humans, Namibia, Population Density, Residence Characteristics, Urban Population
1932-6203
e0271504
Thomson, Dana R.
1ad13f81-f22e-4d89-a288-b05fb08b6c39
Leasure, Douglas R.
c025de11-3c61-45b0-9b19-68d1d37959cd
Bird, Tomas
b491394a-2b91-42d5-8262-d1c0e9ff17cd
Tzavidis, Nikos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e
Thomson, Dana R.
1ad13f81-f22e-4d89-a288-b05fb08b6c39
Leasure, Douglas R.
c025de11-3c61-45b0-9b19-68d1d37959cd
Bird, Tomas
b491394a-2b91-42d5-8262-d1c0e9ff17cd
Tzavidis, Nikos
431ec55d-c147-466d-9c65-0f377b0c1f6a
Tatem, Andrew J.
6c6de104-a5f9-46e0-bb93-a1a7c980513e

Thomson, Dana R., Leasure, Douglas R., Bird, Tomas, Tzavidis, Nikos and Tatem, Andrew J. (2022) How accurate are WorldPop-Global-Unconstrained gridded population data at the cell-level?: A simulation analysis in urban Namibia. PLoS ONE, 17 (7 July), e0271504, [e0271504]. (doi:10.1371/journal.pone.0271504).

Record type: Article

Abstract

Disaggregated population counts are needed to calculate health, economic, and development indicators in Low- and Middle-Income Countries (LMICs), especially in settings of rapid urbanisation. Censuses are often outdated and inaccurate in LMIC settings, and rarely disaggregated at fine geographic scale. Modelled gridded population datasets derived from census data have become widely used by development researchers and practitioners; however, accuracy in these datasets are evaluated at the spatial scale of model input data which is generally courser than the neighbourhood or cell-level scale of many applications. We simulate a realistic synthetic 2016 population in Khomas, Namibia, a majority urban region, and introduce several realistic levels of outdatedness (over 15 years) and inaccuracy in slum, non-slum, and rural areas. We aggregate the synthetic populations by census and administrative boundaries (to mimic census data), resulting in 32 gridded population datasets that are typical of LMIC settings using the WorldPop-Global-Unconstrained gridded population approach. We evaluate the cell-level accuracy of these gridded population datasets using the original synthetic population as a reference. In our simulation, we found large cell-level errors, particularly in slum cells. These were driven by the averaging of population densities in large areal units before model training. Age, accuracy, and aggregation of the input data also played a role in these errors. We suggest incorporating finer-scale training data into gridded population models generally, and WorldPop-Global-Unconstrained in particular (e.g., from routine household surveys or slum community population counts), and use of new building footprint datasets as a covariate to improve cell-level accuracy (as done in some new WorldPop-Global-Constrained datasets). It is important to measure accuracy of gridded population datasets at spatial scales more consistent with how the data are being applied, especially if they are to be used for monitoring key development indicators at neighbourhood scales within cities.

Text
journal.pone.0271504 - Version of Record
Available under License Creative Commons Attribution.
Download (3MB)

More information

Accepted/In Press date: 4 July 2022
Published date: 21 July 2022
Additional Information: Funding Information: Dana R. Thomson was funded by the Economic and Social Research Council (ESRC) grant number ES/5500161/1 (more information at https://esrc.ukri.org/). ESRC played no role in the design, analysis, decision to publish, or preparation of this manuscript. We would like to thank Drs. Angela Luna Hernandez and Ryan Engstrom for their feedback on an earlier version of this work. Publisher Copyright: © 2022 Thomson et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Keywords: Censuses, Computer Simulation, Humans, Namibia, Population Density, Residence Characteristics, Urban Population

Identifiers

Local EPrints ID: 468941
URI: http://eprints.soton.ac.uk/id/eprint/468941
ISSN: 1932-6203
PURE UUID: 635c123c-cb0d-4095-9500-b652daf4f154
ORCID for Douglas R. Leasure: ORCID iD orcid.org/0000-0002-8768-2811
ORCID for Nikos Tzavidis: ORCID iD orcid.org/0000-0002-8413-8095
ORCID for Andrew J. Tatem: ORCID iD orcid.org/0000-0002-7270-941X

Catalogue record

Date deposited: 01 Sep 2022 17:01
Last modified: 17 Mar 2024 03:53

Export record

Altmetrics

Contributors

Author: Dana R. Thomson
Author: Douglas R. Leasure ORCID iD
Author: Tomas Bird
Author: Nikos Tzavidis ORCID iD
Author: Andrew J. Tatem ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×