The University of Southampton
University of Southampton Institutional Repository

Multiple system estimation using covariates having missing values and measurement error: estimating the size of the Māori population in New Zealand

Multiple system estimation using covariates having missing values and measurement error: estimating the size of the Māori population in New Zealand
Multiple system estimation using covariates having missing values and measurement error: estimating the size of the Māori population in New Zealand
We investigate use of two or more linked registers, or lists, for both population size estimation and to investigate the relationship between variables appearing on all or only some registers. This relationship is usually not fully known because some individuals appear in only some registers, and some are not in any register. These two problems have been solved simultaneously using the EM algorithm. We extend this approach to estimate the size of the indigenous Māori population in New Zealand, leading to several innovations: (1) the approach is extended to four registers (including the population census), where the reporting of Māori status differs between registers; (2) some individuals in one or more registers have missing ethnicity, and we adapt the approach to handle this additional missingness; (3) some registers cover subsets of the population by design. We discuss under which assumptions such structural undercoverage can be ignored and provide a general result; (4) we treat the Māori indicator in each register as a variable measured with error, and embed a latent class model in the multiple system estimation to estimate the population size of a latent variable, interpreted as the true Māori status. Finally, we discuss estimating the Māori population size from administrative data only. Supplementary materials for our article are available online.
capture-recapture, population size estimation, latent class model, register coverage
arXiv
Van Der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Cruyff, Maarten
68bcfa19-3d85-4b0f-a6a4-6e148b265f19
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Bycroft, Christine
79d26985-4bb7-4e14-9e19-d5ff9069dc13
Graham, Patrick
6aee6c98-c429-4c03-84e7-25447a8d9e1a
Matheson-Dunning, Nathaniel
f008db4b-5fcd-48d0-8f88-e66fa8de4e14
Van Der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Cruyff, Maarten
68bcfa19-3d85-4b0f-a6a4-6e148b265f19
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Bycroft, Christine
79d26985-4bb7-4e14-9e19-d5ff9069dc13
Graham, Patrick
6aee6c98-c429-4c03-84e7-25447a8d9e1a
Matheson-Dunning, Nathaniel
f008db4b-5fcd-48d0-8f88-e66fa8de4e14

Van Der Heijden, Peter G.M., Cruyff, Maarten, Smith, Paul A., Bycroft, Christine, Graham, Patrick and Matheson-Dunning, Nathaniel (2021) Multiple system estimation using covariates having missing values and measurement error: estimating the size of the Māori population in New Zealand arXiv 34pp. (In Press)

Record type: Monograph (Working Paper)

Abstract

We investigate use of two or more linked registers, or lists, for both population size estimation and to investigate the relationship between variables appearing on all or only some registers. This relationship is usually not fully known because some individuals appear in only some registers, and some are not in any register. These two problems have been solved simultaneously using the EM algorithm. We extend this approach to estimate the size of the indigenous Māori population in New Zealand, leading to several innovations: (1) the approach is extended to four registers (including the population census), where the reporting of Māori status differs between registers; (2) some individuals in one or more registers have missing ethnicity, and we adapt the approach to handle this additional missingness; (3) some registers cover subsets of the population by design. We discuss under which assumptions such structural undercoverage can be ignored and provide a general result; (4) we treat the Māori indicator in each register as a variable measured with error, and embed a latent class model in the multiple system estimation to estimate the population size of a latent variable, interpreted as the true Māori status. Finally, we discuss estimating the Māori population size from administrative data only. Supplementary materials for our article are available online.

Text
2007.00929 - Version of Record
Restricted to Repository staff only
Request a copy
Other
acceptance
Restricted to Repository staff only
Request a copy

More information

Submitted date: 2 July 2020
Accepted/In Press date: 31 May 2021
Keywords: capture-recapture, population size estimation, latent class model, register coverage

Identifiers

Local EPrints ID: 442246
URI: http://eprints.soton.ac.uk/id/eprint/442246
PURE UUID: 5e4dffac-719f-44e5-9539-a00303b29f6e
ORCID for Peter G.M. Van Der Heijden: ORCID iD orcid.org/0000-0002-3345-096X
ORCID for Paul A. Smith: ORCID iD orcid.org/0000-0001-5337-2746

Catalogue record

Date deposited: 10 Jul 2020 16:30
Last modified: 16 Apr 2024 01:46

Export record

Contributors

Author: Maarten Cruyff
Author: Paul A. Smith ORCID iD
Author: Christine Bycroft
Author: Patrick Graham
Author: Nathaniel Matheson-Dunning

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×