Assessing the protection provided by misclassification-based disclosure limitation methods for survey microdata.
Assessing the protection provided by misclassification-based disclosure limitation methods for survey microdata.
Government statistical agencies often apply statistical disclosure limitation techniques to survey microdata to protect the confidentiality of respondents.
There is a need for valid and practical ways to assess
the protection provided. This paper develops some simple methods for disclosure limitation techniques which perturb the values of categorical identifying variables. The methods are applied in numerical experiments based upon census data from the United Kingdom which are subject to
two perturbation techniques: data swapping (random and targeted) and the post randomisation method. Some simplifying approximations to the measure of risk are found to work well in capturing the impacts of these techniques.
These approximations provide simple extensions of existing risk assessment methods based upon Poisson log-linear models. A numerical experiment is also undertaken to assess the impact of multivariate misclassification with an increasing number of identifying variables.
It is found that the misclassification dominates the usual
monotone increasing relationship between this number and risk so that the risk eventually declines, implying less sensitivity of risk to choice of identifying variables. The methods developed in this paper may also be used to obtain more realistic assessments of risk which take account
of the kinds of measurement and other non-sampling errors commonly arising in surveys.
disclosure risk, identification risk, log linear
model, measurement error, post randomization method, data swapping.
Shlomo, Natalie
e749febc-b7b9-4017-be48-96d59dd03215
Skinner, Chris
dec5ef40-49ef-492a-8a1d-eb8c6315b8ce
2010
Shlomo, Natalie
e749febc-b7b9-4017-be48-96d59dd03215
Skinner, Chris
dec5ef40-49ef-492a-8a1d-eb8c6315b8ce
Shlomo, Natalie and Skinner, Chris
(2010)
Assessing the protection provided by misclassification-based disclosure limitation methods for survey microdata.
The Annals of Applied Statistics, Vol. 4 (No. 3).
Abstract
Government statistical agencies often apply statistical disclosure limitation techniques to survey microdata to protect the confidentiality of respondents.
There is a need for valid and practical ways to assess
the protection provided. This paper develops some simple methods for disclosure limitation techniques which perturb the values of categorical identifying variables. The methods are applied in numerical experiments based upon census data from the United Kingdom which are subject to
two perturbation techniques: data swapping (random and targeted) and the post randomisation method. Some simplifying approximations to the measure of risk are found to work well in capturing the impacts of these techniques.
These approximations provide simple extensions of existing risk assessment methods based upon Poisson log-linear models. A numerical experiment is also undertaken to assess the impact of multivariate misclassification with an increasing number of identifying variables.
It is found that the misclassification dominates the usual
monotone increasing relationship between this number and risk so that the risk eventually declines, implying less sensitivity of risk to choice of identifying variables. The methods developed in this paper may also be used to obtain more realistic assessments of risk which take account
of the kinds of measurement and other non-sampling errors commonly arising in surveys.
This record has no associated files available for download.
More information
Published date: 2010
Keywords:
disclosure risk, identification risk, log linear
model, measurement error, post randomization method, data swapping.
Identifiers
Local EPrints ID: 150071
URI: http://eprints.soton.ac.uk/id/eprint/150071
PURE UUID: 5f3ac2ad-90c0-4201-bb4b-5e1ec383a3ef
Catalogue record
Date deposited: 04 May 2010 10:37
Last modified: 08 Jan 2022 17:26
Export record
Contributors
Author:
Natalie Shlomo
Author:
Chris Skinner
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics