Generating contingency tables with fixed marginal probabilities and dependence structures described by loglinear models
Generating contingency tables with fixed marginal probabilities and dependence structures described by loglinear models
We present a method to generate contingency tables that follow loglinear models with prescribed marginal probabilities and dependence structures. We make use of (loglinear) Poisson regression, where the dependence structures, described using odds ratios, are implemented using an offset term. We apply this methodology to carry out simulation studies in the context of population size estimation using dual system and triple system estimators, popular in official statistics. These estimators use contingency tables that summarise the counts of elements enumerated or captured within lists that are linked. The simulation is used to investigate these estimators in the situation that the model assumptions are fulfilled, and the situation that the model assumptions are violated.
contingency tables, loglinear model, odds ratio, offset, simulation, dual-system estimator, triple-system estimator
2797-2812
Hammond, Ceejay
3bc47224-b391-4b15-b42c-730e2c21871c
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
2024
Hammond, Ceejay
3bc47224-b391-4b15-b42c-730e2c21871c
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Smith, Paul A.
a2548525-4f99-4baf-a4d0-2b216cce059c
Hammond, Ceejay, van der Heijden, Peter G.M. and Smith, Paul A.
(2024)
Generating contingency tables with fixed marginal probabilities and dependence structures described by loglinear models.
Journal of Statistical Computation and Simulation, 94 (12), .
(doi:10.48550/arXiv.2303.08568).
Abstract
We present a method to generate contingency tables that follow loglinear models with prescribed marginal probabilities and dependence structures. We make use of (loglinear) Poisson regression, where the dependence structures, described using odds ratios, are implemented using an offset term. We apply this methodology to carry out simulation studies in the context of population size estimation using dual system and triple system estimators, popular in official statistics. These estimators use contingency tables that summarise the counts of elements enumerated or captured within lists that are linked. The simulation is used to investigate these estimators in the situation that the model assumptions are fulfilled, and the situation that the model assumptions are violated.
Text
2303.08568
- Accepted Manuscript
More information
Accepted/In Press date: 5 May 2024
e-pub ahead of print date: 15 June 2024
Published date: 2024
Keywords:
contingency tables, loglinear model, odds ratio, offset, simulation, dual-system estimator, triple-system estimator
Identifiers
Local EPrints ID: 476208
URI: http://eprints.soton.ac.uk/id/eprint/476208
ISSN: 0094-9655
PURE UUID: b184747b-b2bc-48fa-ae10-e2ac1057c324
Catalogue record
Date deposited: 14 Apr 2023 16:32
Last modified: 05 Nov 2024 02:46
Export record
Altmetrics
Contributors
Author:
Ceejay Hammond
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics