Statistical disclosure control for survey data
Statistical disclosure control for survey data
Statistical disclosure control refers to the methodology used in the design of the statistical outputs from a survey for protecting the confidentiality of respondents’ answers. The threat to confidentiality is assumed to come from a hypothetical intruder who has access to these outputs and seeks to use them to disclose information about a survey respondent. One key concern relates to identity disclosure, which would occur if the intruder were able to link a known individual (or other unit) to an element of the output. Another main concern relates to attribute disclosure, which would occur if the intruder could determine the value of some survey variable for an identified individual (or other unit) using the statistical output. Measures of the probability of disclosure are called disclosure risk. If this level of risk is deemed unacceptable then it may be necessary to apply a method of statistical disclosure control to the output. The choice of which method and how much protection to apply depends not just on the impact on disclosure risk but also on the impact on the utility of the output to users. This paper provides a review of statistical disclosure control methodology for two main types of survey output: (i) tables of estimates of population parameters and (ii) microdata, often released as a rectangular file of variables by analysis units. For each of these types of output, the definition and estimation of disclosure risk is discussed as well as methods for statistical disclosure control.
Southampton Statistical Sciences Research Institute, University of Southampton
Skinner, Chris
dec5ef40-49ef-492a-8a1d-eb8c6315b8ce
16 February 2009
Skinner, Chris
dec5ef40-49ef-492a-8a1d-eb8c6315b8ce
Skinner, Chris
(2009)
Statistical disclosure control for survey data
(S3RI Methodology Working Papers, M09/03)
Southampton, UK.
Southampton Statistical Sciences Research Institute, University of Southampton
21pp.
Record type:
Monograph
(Working Paper)
Abstract
Statistical disclosure control refers to the methodology used in the design of the statistical outputs from a survey for protecting the confidentiality of respondents’ answers. The threat to confidentiality is assumed to come from a hypothetical intruder who has access to these outputs and seeks to use them to disclose information about a survey respondent. One key concern relates to identity disclosure, which would occur if the intruder were able to link a known individual (or other unit) to an element of the output. Another main concern relates to attribute disclosure, which would occur if the intruder could determine the value of some survey variable for an identified individual (or other unit) using the statistical output. Measures of the probability of disclosure are called disclosure risk. If this level of risk is deemed unacceptable then it may be necessary to apply a method of statistical disclosure control to the output. The choice of which method and how much protection to apply depends not just on the impact on disclosure risk but also on the impact on the utility of the output to users. This paper provides a review of statistical disclosure control methodology for two main types of survey output: (i) tables of estimates of population parameters and (ii) microdata, often released as a rectangular file of variables by analysis units. For each of these types of output, the definition and estimation of disclosure risk is discussed as well as methods for statistical disclosure control.
Text
65447-01.pdf
- Author's Original
More information
Published date: 16 February 2009
Identifiers
Local EPrints ID: 65447
URI: http://eprints.soton.ac.uk/id/eprint/65447
PURE UUID: 4e2919f6-d0cf-49db-b635-171adcd19a07
Catalogue record
Date deposited: 17 Feb 2009
Last modified: 13 Mar 2024 17:40
Export record
Contributors
Author:
Chris Skinner
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics