The University of Southampton
University of Southampton Institutional Repository

Extending Zelterman’s approach for robust estimation of population size to zero-truncated clustered data

Extending Zelterman’s approach for robust estimation of population size to zero-truncated clustered data
Extending Zelterman’s approach for robust estimation of population size to zero-truncated clustered data
Estimation of population size with missing zero-class is an important problem that is encountered in epidemiological assessment studies. Fitting a Poisson model to the observed data by the method of maximum likelihood and estimation of the population size based on this fit is an approach that has been widely used for this purpose. In practice, however, the Poisson assumption is seldom satisfied. Zelterman (1988) has proposed a robust estimator for unclustered data that works well in a wide class of distributions applicable for count data. In the work presented here, we extend this estimator to clustered data. The estimator requires fitting a zero-truncated homogeneous Poisson model by maximum likelihood and thereby using a Horvitz–Thompson estimator of population size. This was found to work well, when the data follow the hypothesized homogeneous Poisson model. However, when the true distribution deviates from the hypothesized model, the population size was found to be underestimated. In the search of a more robust estimator, we focused on three models that use all clusters with exactly one case, those clusters with exactly two cases and those with exactly three cases to estimate the probability of the zero-class and thereby use data collected on all the clusters in the Horvitz–Thompson estimator of population size. Loss in efficiency associated with gain in robustness was examined based on a simulation study. As a trade-off between gain in robustness and loss in efficiency, the model that uses data collected on clusters with at most three cases to estimate the probability of the zero-class was found to be preferred in general. In applications, we recommend obtaining estimates from all three models and making a choice considering the estimates from the three models, robustness and the loss in efficiency
0323-3847
584-596
Navaratna, Chandanie
1575844b-b9f2-4120-bcad-3400e4414e3b
Del Rio Vilas, Victor Javier
c439650f-6c9d-42c1-80a2-2fa570de525f
Böhning, Dankmar
1df635d4-e3dc-44d0-b61d-5fd11f6434e1
Navaratna, Chandanie
1575844b-b9f2-4120-bcad-3400e4414e3b
Del Rio Vilas, Victor Javier
c439650f-6c9d-42c1-80a2-2fa570de525f
Böhning, Dankmar
1df635d4-e3dc-44d0-b61d-5fd11f6434e1

Navaratna, Chandanie, Del Rio Vilas, Victor Javier and Böhning, Dankmar (2008) Extending Zelterman’s approach for robust estimation of population size to zero-truncated clustered data. Biometrical Journal, 50 (4), 584-596. (doi:10.1002/bimj.200710441). (PMID:18663764)

Record type: Article

Abstract

Estimation of population size with missing zero-class is an important problem that is encountered in epidemiological assessment studies. Fitting a Poisson model to the observed data by the method of maximum likelihood and estimation of the population size based on this fit is an approach that has been widely used for this purpose. In practice, however, the Poisson assumption is seldom satisfied. Zelterman (1988) has proposed a robust estimator for unclustered data that works well in a wide class of distributions applicable for count data. In the work presented here, we extend this estimator to clustered data. The estimator requires fitting a zero-truncated homogeneous Poisson model by maximum likelihood and thereby using a Horvitz–Thompson estimator of population size. This was found to work well, when the data follow the hypothesized homogeneous Poisson model. However, when the true distribution deviates from the hypothesized model, the population size was found to be underestimated. In the search of a more robust estimator, we focused on three models that use all clusters with exactly one case, those clusters with exactly two cases and those with exactly three cases to estimate the probability of the zero-class and thereby use data collected on all the clusters in the Horvitz–Thompson estimator of population size. Loss in efficiency associated with gain in robustness was examined based on a simulation study. As a trade-off between gain in robustness and loss in efficiency, the model that uses data collected on clusters with at most three cases to estimate the probability of the zero-class was found to be preferred in general. In applications, we recommend obtaining estimates from all three models and making a choice considering the estimates from the three models, robustness and the loss in efficiency

This record has no associated files available for download.

More information

e-pub ahead of print date: 29 July 2008
Published date: August 2008
Organisations: Statistics, Statistical Sciences Research Institute

Identifiers

Local EPrints ID: 210481
URI: http://eprints.soton.ac.uk/id/eprint/210481
ISSN: 0323-3847
PURE UUID: 9094f4f8-5f30-4d82-a195-30445b6aa73e
ORCID for Dankmar Böhning: ORCID iD orcid.org/0000-0003-0638-7106

Catalogue record

Date deposited: 09 Feb 2012 14:11
Last modified: 15 Mar 2024 03:39

Export record

Altmetrics

Contributors

Author: Chandanie Navaratna
Author: Victor Javier Del Rio Vilas

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×