A grid-based sample design framework for household surveys
A grid-based sample design framework for household surveys
Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic settings. To tackle the need for representative sampling frames, we propose an original grid-based sample design framework introducing essential concepts of spatial sampling in household surveys. In this framework, the sampling frame is defined based on gridded population estimates and formalized as a bi-dimensional random field, characterized by spatial trends, spatial autocorrelation, and stratification. The sampling design reflects the characteristics of the random field by combining contextual stratification and proportional to population size sampling. A nonparametric estimator is applied to evaluate the sampling design and inform sample size estimation. We demonstrate an application of the proposed framework through a case study developed in two provinces located in the western part of the Democratic Republic of the Congo. We define a sampling frame consisting of settled cells with associated population estimates. We then perform a contextual stratification by applying a principal component analysis (PCA) and k-means clustering to a set of gridded geospatial covariates, and sample settled cells proportionally to population size. Lastly, we evaluate the sampling design by contrasting the empirical cumulative distribution function for the entire population of interest and its weighted counterpart across different sample sizes and identify an adequate sample size using the Kolmogorov-Smirnov distance between the two functions. The results of the case study underscore the strengths and limitations of the proposed grid-based sample design framework and foster further research into the application of spatial sampling concepts in household surveys.
Demography, Household Surveys, Sample Design, Spatial Sampling, Gridded Population, Democratic Republic of the Congo
1-18
Boo, Gianluca
d49f7aaa-6d95-4e36-b9be-e469911c4a3d
Darin, Edith
868fa688-2567-4dbd-aa12-3dcc91f2aa8d
Thomson-Browne, Dana, Renee
c6aa22a0-9ee2-4d86-9bd4-b3a8487eb15b
Tatem, Andrew
6c6de104-a5f9-46e0-bb93-a1a7c980513e
January 2020
Boo, Gianluca
d49f7aaa-6d95-4e36-b9be-e469911c4a3d
Darin, Edith
868fa688-2567-4dbd-aa12-3dcc91f2aa8d
Thomson-Browne, Dana, Renee
c6aa22a0-9ee2-4d86-9bd4-b3a8487eb15b
Tatem, Andrew
6c6de104-a5f9-46e0-bb93-a1a7c980513e
Boo, Gianluca, Darin, Edith, Thomson-Browne, Dana, Renee and Tatem, Andrew
(2020)
A grid-based sample design framework for household surveys.
Gates Open Research, 4 (13), .
(doi:10.12688/gatesopenres.13107.1).
Abstract
Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic settings. To tackle the need for representative sampling frames, we propose an original grid-based sample design framework introducing essential concepts of spatial sampling in household surveys. In this framework, the sampling frame is defined based on gridded population estimates and formalized as a bi-dimensional random field, characterized by spatial trends, spatial autocorrelation, and stratification. The sampling design reflects the characteristics of the random field by combining contextual stratification and proportional to population size sampling. A nonparametric estimator is applied to evaluate the sampling design and inform sample size estimation. We demonstrate an application of the proposed framework through a case study developed in two provinces located in the western part of the Democratic Republic of the Congo. We define a sampling frame consisting of settled cells with associated population estimates. We then perform a contextual stratification by applying a principal component analysis (PCA) and k-means clustering to a set of gridded geospatial covariates, and sample settled cells proportionally to population size. Lastly, we evaluate the sampling design by contrasting the empirical cumulative distribution function for the entire population of interest and its weighted counterpart across different sample sizes and identify an adequate sample size using the Kolmogorov-Smirnov distance between the two functions. The results of the case study underscore the strengths and limitations of the proposed grid-based sample design framework and foster further research into the application of spatial sampling concepts in household surveys.
Text
8d6e4a3e-eed7-4e1f-bfc6-bcbde196c8d4_13107_-_gianluca_boo
- Author's Original
Text
292ddc9f-cf88-450d-8565-94176348a3dd 13107 gianluca boo
- Version of Record
More information
Submitted date: 27 January 2020
e-pub ahead of print date: 27 January 2020
Published date: January 2020
Keywords:
Demography, Household Surveys, Sample Design, Spatial Sampling, Gridded Population, Democratic Republic of the Congo
Identifiers
Local EPrints ID: 438457
URI: http://eprints.soton.ac.uk/id/eprint/438457
ISSN: 2572-4754
PURE UUID: 1062a2fb-1448-4946-81bd-312eab0337da
Catalogue record
Date deposited: 10 Mar 2020 17:32
Last modified: 01 Aug 2024 01:56
Export record
Altmetrics
Contributors
Author:
Gianluca Boo
Author:
Dana, Renee Thomson-Browne
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics