Statistical disclosure control methods for census frequency tables
Statistical disclosure control methods for census frequency tables
 
  This paper provides a review of common statistical disclosure control (SDC) methods implemented at Statistical Agencies for standard tabular outputs containing whole population counts from a Census (either enumerated or based on a register). These methods include record swapping on the microdata prior to its tabulation and rounding of entries in the tables after they are produced. The approach for assessing SDC methods is based on a disclosure risk–data utility framework and the need to find the balance between managing disclosure risk while maximizing the amount of information that can be released to users and ensuring high quality outputs. To carry out the analysis, quantitative measures of disclosure risk and data utility are defined and methods compared. Conclusions from the analysis show that record swapping as a sole SDC method leaves high probabilities of disclosure risk. Targeted record swapping lowers the disclosure risk, but there is more distortion to distributions. Small cell adjustments (rounding) give protection to Census tables by eliminating small cells but only one set of variables and geographies can be disseminated in order to avoid disclosure by differencing nested tables. Full random rounding offers more protection against disclosure by differencing, but margins are typically rounded separately from the internal cells and tables are not additive. Rounding procedures protect against the perception of disclosure risk compared to record swapping since no small cells appear in the tables. Combining rounding with record swapping raises the level of protection but increases the loss of utility to Census tabular outputs. For some statistical analysis, the combination of record swapping and rounding balances to some degree opposing effects that the methods have on the utility of the tables.
  
  
    University of Southampton
   
  
    
      Shlomo, Natalie
      
        e749febc-b7b9-4017-be48-96d59dd03215
      
     
  
  
   
  
  
    
      28 February 2007
    
    
  
  
    
      Shlomo, Natalie
      
        e749febc-b7b9-4017-be48-96d59dd03215
      
     
  
       
    
 
  
    
      
  
  
  
  
  
  
    Shlomo, Natalie
  
  
  
  
   
    (2007)
  
  
    
    Statistical disclosure control methods for census frequency tables
  
  
  
    (S3RI Methodology Working Papers, M07/04)
  
  
  
  
    
      
        
   
  
    Southampton, UK.
   
        
      
    
  
  University of Southampton 
  40pp.
  
  
  
  
  
   
  
    
      Record type:
      Monograph
      
      (Working Paper)
      
    
   
    
    
      
        
          Abstract
          This paper provides a review of common statistical disclosure control (SDC) methods implemented at Statistical Agencies for standard tabular outputs containing whole population counts from a Census (either enumerated or based on a register). These methods include record swapping on the microdata prior to its tabulation and rounding of entries in the tables after they are produced. The approach for assessing SDC methods is based on a disclosure risk–data utility framework and the need to find the balance between managing disclosure risk while maximizing the amount of information that can be released to users and ensuring high quality outputs. To carry out the analysis, quantitative measures of disclosure risk and data utility are defined and methods compared. Conclusions from the analysis show that record swapping as a sole SDC method leaves high probabilities of disclosure risk. Targeted record swapping lowers the disclosure risk, but there is more distortion to distributions. Small cell adjustments (rounding) give protection to Census tables by eliminating small cells but only one set of variables and geographies can be disseminated in order to avoid disclosure by differencing nested tables. Full random rounding offers more protection against disclosure by differencing, but margins are typically rounded separately from the internal cells and tables are not additive. Rounding procedures protect against the perception of disclosure risk compared to record swapping since no small cells appear in the tables. Combining rounding with record swapping raises the level of protection but increases the loss of utility to Census tabular outputs. For some statistical analysis, the combination of record swapping and rounding balances to some degree opposing effects that the methods have on the utility of the tables.
         
      
      
        
          
            
  
    Text
 44610-01.pdf
     - Other
   
  
  
 
          
            
          
            
           
            
           
        
        
       
    
   
  
  
  More information
  
    
      Published date: 28 February 2007
 
    
  
  
    
  
    
     
        Additional Information:
        This paper supersedes working paper M06/03.
      
    
  
    
  
    
  
    
  
    
  
    
  
    
  
  
  
    
  
  
        Identifiers
        Local EPrints ID: 44610
        URI: http://eprints.soton.ac.uk/id/eprint/44610
        
        
        
        
          PURE UUID: 9c34d829-5481-4c56-baa5-e824e30476df
        
  
    
        
          
        
    
  
  Catalogue record
  Date deposited: 28 Feb 2007
  Last modified: 11 Dec 2021 16:23
  Export record
  
  
 
 
  
    
    
      Contributors
      
          
          Author:
          
            
            
              Natalie Shlomo
            
          
        
      
      
      
    
  
   
  
    Download statistics
    
      Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
      
      View more statistics