The University of Southampton
University of Southampton Institutional Repository

Evaluating the accuracy and feasibility of gridded population sampling to overcome bias due to missing populations in household surveys

Evaluating the accuracy and feasibility of gridded population sampling to overcome bias due to missing populations in household surveys
Evaluating the accuracy and feasibility of gridded population sampling to overcome bias due to missing populations in household surveys
In low- and middle-income countries (LMICs), household survey data are a main source of information for planning, evaluation, and decision-making. LMIC survey methods have not changed substantially in forty years, while population trends and technologies have. This mismatch is likely excluding an increasing number of vulnerable and mobile populations and leading to erosion in LMIC household survey data accuracy. When populations are under-represented in surveys, they are less likely to benefit from funds allocated by governments and international agencies to promote health and social equity. This thesis evaluates the accuracy of using gridded population sample frames as an alternative to census sample frames, and the feasibility of implementing gridded population surveys in a real-world setting.
To compare accuracy of census and gridded population sample frames, a realistic “true” population was simulated in Khomas, Namibia for 2016. The simulated population was derived from Namibia’s 2011 census and 2013 Demographic and Health Survey comprising several household outcome indicators and latitude-longitude coordinates digitised from actual building locations in satellite imagery. Sixteen versions of realistic outdated-inaccurate censuses were simulated by (a) removing households in buildings not present in satellite imagery in 2011, 2006, and 2001, and (b) randomly removing a percentage of rural, urban slum, and urban non-slum households based on a literature review of LMIC census undercounts. For each realistic census, a gridded population sample frame was modelled using WorldPop methodologies. Accuracy was then assessed by drawing repeated samples from the simulated censuses and gridded population datasets, and comparing them to the “true” population. To evaluate feasibility, a gridded population survey was conducted in Kathmandu, Nepal and focus group discussions and observation were made with survey implementers.
In the accuracy assessment, I found that the main source of error in gridded population datasets was not outdated, inaccurate census inputs, but rather, the WorldPop practice of using average population density from aggregated census units to predict population density in finer grid cells. Despite inaccuracies, I found that WorldPop methods disaggregated population in or around settlements, and that the distribution of population estimates in grid cells within urban areas were +/-20% the “true” population distribution. Gridded population sample frames derived from outdated or inaccurate census data produced more accurate survey estimates than the original outdated, inaccurate census EAs frames. In the Kathmandu feasibility assessment, gridded population sampling proved to be cost-effective and feasible, and use of an area-microcensus design identified more vulnerable and mobile households than a comparable two-stage sample.
University of Southampton
Thomson, Dana
c6aa22a0-9ee2-4d86-9bd4-b3a8487eb15b
Thomson, Dana
c6aa22a0-9ee2-4d86-9bd4-b3a8487eb15b
Tzavidis, Nikolaos
431ec55d-c147-466d-9c65-0f377b0c1f6a

Thomson, Dana (2020) Evaluating the accuracy and feasibility of gridded population sampling to overcome bias due to missing populations in household surveys. University of Southampton, Doctoral Thesis, 276pp.

Record type: Thesis (Doctoral)

Abstract

In low- and middle-income countries (LMICs), household survey data are a main source of information for planning, evaluation, and decision-making. LMIC survey methods have not changed substantially in forty years, while population trends and technologies have. This mismatch is likely excluding an increasing number of vulnerable and mobile populations and leading to erosion in LMIC household survey data accuracy. When populations are under-represented in surveys, they are less likely to benefit from funds allocated by governments and international agencies to promote health and social equity. This thesis evaluates the accuracy of using gridded population sample frames as an alternative to census sample frames, and the feasibility of implementing gridded population surveys in a real-world setting.
To compare accuracy of census and gridded population sample frames, a realistic “true” population was simulated in Khomas, Namibia for 2016. The simulated population was derived from Namibia’s 2011 census and 2013 Demographic and Health Survey comprising several household outcome indicators and latitude-longitude coordinates digitised from actual building locations in satellite imagery. Sixteen versions of realistic outdated-inaccurate censuses were simulated by (a) removing households in buildings not present in satellite imagery in 2011, 2006, and 2001, and (b) randomly removing a percentage of rural, urban slum, and urban non-slum households based on a literature review of LMIC census undercounts. For each realistic census, a gridded population sample frame was modelled using WorldPop methodologies. Accuracy was then assessed by drawing repeated samples from the simulated censuses and gridded population datasets, and comparing them to the “true” population. To evaluate feasibility, a gridded population survey was conducted in Kathmandu, Nepal and focus group discussions and observation were made with survey implementers.
In the accuracy assessment, I found that the main source of error in gridded population datasets was not outdated, inaccurate census inputs, but rather, the WorldPop practice of using average population density from aggregated census units to predict population density in finer grid cells. Despite inaccuracies, I found that WorldPop methods disaggregated population in or around settlements, and that the distribution of population estimates in grid cells within urban areas were +/-20% the “true” population distribution. Gridded population sample frames derived from outdated or inaccurate census data produced more accurate survey estimates than the original outdated, inaccurate census EAs frames. In the Kathmandu feasibility assessment, gridded population sampling proved to be cost-effective and feasible, and use of an area-microcensus design identified more vulnerable and mobile households than a comparable two-stage sample.

Text
thomson_28073568_thesis - Version of Record
Available under License University of Southampton Thesis Licence.
Download (12MB)
Text
TPermission to deposit thesis - form - signed
Restricted to Repository staff only

More information

Published date: March 2020

Identifiers

Local EPrints ID: 450772
URI: http://eprints.soton.ac.uk/id/eprint/450772
PURE UUID: f0b33dad-ad45-4ed9-aeea-e229239fb98c
ORCID for Dana Thomson: ORCID iD orcid.org/0000-0002-9507-9123
ORCID for Nikolaos Tzavidis: ORCID iD orcid.org/0000-0002-8413-8095

Catalogue record

Date deposited: 11 Aug 2021 16:30
Last modified: 17 Mar 2024 02:54

Export record

Contributors

Author: Dana Thomson ORCID iD
Thesis advisor: Nikolaos Tzavidis ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×