The University of Southampton
University of Southampton Institutional Repository

Estimating the re-identification risk per record in microdata

Estimating the re-identification risk per record in microdata
Estimating the re-identification risk per record in microdata
A measure of re-identification risk at the record level has a variety of potential uses in statistical disclosure control for microdata. The conceptual basis of such a measure is considered. The risk is conceived of broadly as the evidence in support of a link between the record and the unit in the population from which it is derived. For discrete key variables subject to no measurement error, a measure is derived which reflects the probability that the record is unique in the population. Under certain assumptions, two approaches are described for estimating this measure from the microdata. These approaches are applied to a 10% sample of microdata from the 1991 Census in Great Britain. It is found that the resulting risk measures can indeed be used successfully to establish whether sample unique records are unique in the population. The implications of these findings are discussed.
key variable, log-linear model, lognormal distribution, population uniqueness, statistical disclosure control
0282-423X
361-372
Skinner, C.J.
48081d82-c596-436e-8846-c9d0a1bf158d
Holmes, D.J.
acb9dc00-6021-4eee-8219-2c5032d62ce7
Skinner, C.J.
48081d82-c596-436e-8846-c9d0a1bf158d
Holmes, D.J.
acb9dc00-6021-4eee-8219-2c5032d62ce7

Skinner, C.J. and Holmes, D.J. (1998) Estimating the re-identification risk per record in microdata. Journal of Official Statistics, 14 (4), 361-372.

Record type: Article

Abstract

A measure of re-identification risk at the record level has a variety of potential uses in statistical disclosure control for microdata. The conceptual basis of such a measure is considered. The risk is conceived of broadly as the evidence in support of a link between the record and the unit in the population from which it is derived. For discrete key variables subject to no measurement error, a measure is derived which reflects the probability that the record is unique in the population. Under certain assumptions, two approaches are described for estimating this measure from the microdata. These approaches are applied to a 10% sample of microdata from the 1991 Census in Great Britain. It is found that the resulting risk measures can indeed be used successfully to establish whether sample unique records are unique in the population. The implications of these findings are discussed.

This record has no associated files available for download.

More information

Published date: 1998
Keywords: key variable, log-linear model, lognormal distribution, population uniqueness, statistical disclosure control

Identifiers

Local EPrints ID: 34237
URI: http://eprints.soton.ac.uk/id/eprint/34237
ISSN: 0282-423X
PURE UUID: d279b640-dcf4-48b6-8b5f-72e408854ecf

Catalogue record

Date deposited: 20 Dec 2007
Last modified: 11 Dec 2021 15:23

Export record

Contributors

Author: C.J. Skinner
Author: D.J. Holmes

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×