Methods for combining administrative data to estimate population counts
Methods for combining administrative data to estimate population counts
Governments require information about population counts and characteristics in order to make plans, develop policies and provide public services. The main source of this information is the traditional population censuses. However, they are costly, and the information collected by the decennial censuses goes out-of-date easily. For this reason, this thesis has two main aims: to develop methodologies to combine administrative data sources to estimate population counts in the absence of both a traditional census, and to produce uncertainty estimates for the estimated population counts.
Although, the methodologies are illustrated using administrative data sources from England and Wales, they can easily be applied to other countries' administrative data sources. The most comprehensive administrative sources in England and Wales are the NHS Patient Register and the Customer Information System. However, it is known that both of these sources exceed the census estimates. Therefore, it is crucial to use another source to adjust the bias to estimate population counts using these administrative sources.
Three substantial chapters assessing methodologies to combine administrative sources with the auxiliary information are presented. The first of these chapters presents a basis methodology, log-linear models with offsets, which is extended in the following chapters. The second chapter extends these models by using individually linked administrative sources. The third chapter improves on the basis models to produce measures of uncertainty.
This thesis evaluates different log-linear models in terms of their capacity for producing accurate population counts for age group, sex and local authority groups both within the classical and the Bayesian framework. On the other hand, it also presents a detailed perspective to understand which population groups tend to be missed by the administrative data in England and Wales, and how much they can be improved just by combining them with the specific association structures obtained from auxiliary data sources.
Yildiz, Dilek
71798192-b121-4cd0-9025-7ad5131ac6d5
April 2016
Yildiz, Dilek
71798192-b121-4cd0-9025-7ad5131ac6d5
Smith, Peter
961a01a3-bf4c-43ca-9599-5be4fd5d3940
Van Der Heijden, Peter
85157917-3b33-4683-81be-713f987fd612
Yildiz, Dilek
(2016)
Methods for combining administrative data to estimate population counts.
University of Southampton, Faculty of Social, Human and Mathematical Sciences, Doctoral Thesis, 156pp.
Record type:
Thesis
(Doctoral)
Abstract
Governments require information about population counts and characteristics in order to make plans, develop policies and provide public services. The main source of this information is the traditional population censuses. However, they are costly, and the information collected by the decennial censuses goes out-of-date easily. For this reason, this thesis has two main aims: to develop methodologies to combine administrative data sources to estimate population counts in the absence of both a traditional census, and to produce uncertainty estimates for the estimated population counts.
Although, the methodologies are illustrated using administrative data sources from England and Wales, they can easily be applied to other countries' administrative data sources. The most comprehensive administrative sources in England and Wales are the NHS Patient Register and the Customer Information System. However, it is known that both of these sources exceed the census estimates. Therefore, it is crucial to use another source to adjust the bias to estimate population counts using these administrative sources.
Three substantial chapters assessing methodologies to combine administrative sources with the auxiliary information are presented. The first of these chapters presents a basis methodology, log-linear models with offsets, which is extended in the following chapters. The second chapter extends these models by using individually linked administrative sources. The third chapter improves on the basis models to produce measures of uncertainty.
This thesis evaluates different log-linear models in terms of their capacity for producing accurate population counts for age group, sex and local authority groups both within the classical and the Bayesian framework. On the other hand, it also presents a detailed perspective to understand which population groups tend to be missed by the administrative data in England and Wales, and how much they can be improved just by combining them with the specific association structures obtained from auxiliary data sources.
Text
D.Yildiz_Thesis_not_signed.pdf
- Other
Restricted to Repository staff only
More information
Published date: April 2016
Organisations:
University of Southampton, Social Statistics & Demography
Identifiers
Local EPrints ID: 397608
URI: http://eprints.soton.ac.uk/id/eprint/397608
PURE UUID: 4a6df3a4-f848-431e-afd8-689073d5bfda
Catalogue record
Date deposited: 15 Jul 2016 09:28
Last modified: 15 Mar 2024 03:46
Export record
Contributors
Author:
Dilek Yildiz
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics