Multiple system estimation using covariates having missing values and measurement error: Estimating the size of the Maori population in New Zealand
Multiple system estimation using covariates having missing values and measurement error: Estimating the size of the Maori population in New Zealand
We investigate the use of two or more linked lists, for both population size estimation and the relationship between variables appearing on all or only some lists. This relationship is usually not fully known because some individuals appear in only some lists, and some are not in any list. These two problems have been solved simultaneously using the EM algorithm. We extend this approach to estimate the size of the indigenous Māori population in New Zealand, leading to several innovations: (1) the approach is extended to four lists (including the population census), where the reporting of Māori status differs between registers; (2) some individuals in one or more lists have missing ethnicity, and we adapt the approach to handle this additional missingness; (3) some lists cover subsets of the population by design. We discuss under which assumptions such structural undercoverage can be ignored and provide a general result; (4) we treat the Māori indicator in each list as a variable measured with error, and embed a latent class model in the multiple system estimation to estimate the population size of a latent variable, interpreted as the true Māori status. Finally, we discuss estimating the Māori population size from administrative data only. Supplementary materials for our article are available online.
capture-recapture, latent class model, population size estimation, register coverage
156 - 177
Van Der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Cruyff, Maarten
68bcfa19-3d85-4b0f-a6a4-6e148b265f19
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Bycroft, Christine
79d26985-4bb7-4e14-9e19-d5ff9069dc13
Graham, Patrick
6aee6c98-c429-4c03-84e7-25447a8d9e1a
Matheson-Dunning, Nathaniel
f008db4b-5fcd-48d0-8f88-e66fa8de4e14
21 January 2022
Van Der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Cruyff, Maarten
68bcfa19-3d85-4b0f-a6a4-6e148b265f19
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Bycroft, Christine
79d26985-4bb7-4e14-9e19-d5ff9069dc13
Graham, Patrick
6aee6c98-c429-4c03-84e7-25447a8d9e1a
Matheson-Dunning, Nathaniel
f008db4b-5fcd-48d0-8f88-e66fa8de4e14
Van Der Heijden, Peter G.M., Cruyff, Maarten, Smith, Paul A., Bycroft, Christine, Graham, Patrick and Matheson-Dunning, Nathaniel
(2022)
Multiple system estimation using covariates having missing values and measurement error: Estimating the size of the Maori population in New Zealand.
Journal of the Royal Statistical Society. Series A (General), 185 (1), .
(doi:10.1111/rssa.12731).
Abstract
We investigate the use of two or more linked lists, for both population size estimation and the relationship between variables appearing on all or only some lists. This relationship is usually not fully known because some individuals appear in only some lists, and some are not in any list. These two problems have been solved simultaneously using the EM algorithm. We extend this approach to estimate the size of the indigenous Māori population in New Zealand, leading to several innovations: (1) the approach is extended to four lists (including the population census), where the reporting of Māori status differs between registers; (2) some individuals in one or more lists have missing ethnicity, and we adapt the approach to handle this additional missingness; (3) some lists cover subsets of the population by design. We discuss under which assumptions such structural undercoverage can be ignored and provide a general result; (4) we treat the Māori indicator in each list as a variable measured with error, and embed a latent class model in the multiple system estimation to estimate the population size of a latent variable, interpreted as the true Māori status. Finally, we discuss estimating the Māori population size from administrative data only. Supplementary materials for our article are available online.
Text
2007.00929
- Accepted Manuscript
More information
Accepted/In Press date: 31 May 2021
e-pub ahead of print date: 12 July 2021
Published date: 21 January 2022
Additional Information:
Publisher Copyright:
© 2021 The Authors. Journal of the Royal Statistical Society: Series A (Statistics in Society) published by John Wiley & Sons Ltd on behalf of Royal Statistical Society
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
Keywords:
capture-recapture, latent class model, population size estimation, register coverage
Identifiers
Local EPrints ID: 449735
URI: http://eprints.soton.ac.uk/id/eprint/449735
ISSN: 0035-9238
PURE UUID: bd7c4d42-6dae-427a-9bbf-88fe5ef07656
Catalogue record
Date deposited: 15 Jun 2021 16:31
Last modified: 16 Apr 2024 04:02
Export record
Altmetrics
Contributors
Author:
Maarten Cruyff
Author:
Christine Bycroft
Author:
Patrick Graham
Author:
Nathaniel Matheson-Dunning
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics