Dual system estimation using mixed effects loglinear models
Dual system estimation using mixed effects loglinear models
In official statistics, dual system estimation (DSE) is a well-known tool to estimate the size of a population. Two sources are linked, and the number of units that are missed by both sources is estimated. Often dual system estimation is carried out in each of the levels of a stratifying variable, such as region. DSE can be considered a loglinear independence model, and, with a stratifying variable, a loglinear conditional independence model. The standard approach is to estimate parameters for each level of the stratifying variable. Thus, when the number of levels of the stratifying variable is large, the number of parameters estimated is large as well. Mixed effects loglinear models, where sets of parameters involving the stratifying variable are replaced by a distribution parameterised by its mean and a variance, have also been proposed, and we investigate their properties through simulation. In our simulation studies the mixed effects loglinear model outperforms the fixed effects loglinear model although only to a small extent in terms of mean squared error. We show how mixed effects dual system estimation can be extended to multiple system estimation.
Hammond, Ceejay
3bc47224-b391-4b15-b42c-730e2c21871c
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
2 May 2025
Hammond, Ceejay
3bc47224-b391-4b15-b42c-730e2c21871c
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
[Unknown type: UNSPECIFIED]
Abstract
In official statistics, dual system estimation (DSE) is a well-known tool to estimate the size of a population. Two sources are linked, and the number of units that are missed by both sources is estimated. Often dual system estimation is carried out in each of the levels of a stratifying variable, such as region. DSE can be considered a loglinear independence model, and, with a stratifying variable, a loglinear conditional independence model. The standard approach is to estimate parameters for each level of the stratifying variable. Thus, when the number of levels of the stratifying variable is large, the number of parameters estimated is large as well. Mixed effects loglinear models, where sets of parameters involving the stratifying variable are replaced by a distribution parameterised by its mean and a variance, have also been proposed, and we investigate their properties through simulation. In our simulation studies the mixed effects loglinear model outperforms the fixed effects loglinear model although only to a small extent in terms of mean squared error. We show how mixed effects dual system estimation can be extended to multiple system estimation.
Text
2505.01359v1
- Author's Original
More information
Published date: 2 May 2025
Identifiers
Local EPrints ID: 502970
URI: http://eprints.soton.ac.uk/id/eprint/502970
PURE UUID: ef7a7889-bc6c-472e-90b8-aefb04ee4784
Catalogue record
Date deposited: 15 Jul 2025 16:49
Last modified: 22 Aug 2025 02:11
Export record
Altmetrics
Contributors
Author:
Ceejay Hammond
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics