A generic tool to assess impact of changing edit rules in a business survey - an application to the UK Annual Business Inquiry part 2
A generic tool to assess impact of changing edit rules in a business survey - an application to the UK Annual Business Inquiry part 2
Business surveys often use complex sets of edit rules (edits, for short) to check returned questionnaires (records), locate suspicious or unacceptable responses, and support data cleaning operations prior to using the survey responses for estimation of the required target parameters. These sets of edits are complex because they may involve large numbers of survey questionnaires and variables, they may contain a large number of edits, and the edits may depend on a large number of tolerance parameters. When such sets of edits are used, they may cause large numbers of record failures and generate substantial costs of revision, especially if edit failures are dealt with by means of clerical operations, like reviewing original paper questionnaires or digital images of these, and re-contacting businesses for clarification and/or correction of the responses provided. Costs can be high both in terms of the resources required, as well as in terms of timeliness of survey processing, by delaying availability of the survey data for estimation and publication.
In this paper we describe a generic tool, developed as a result of the collaboration between the University of Southampton and the ONS. This tool can help to assess the potential impact of changing the edits in a specified business survey. It is a SAS macro using the IML language which enables calculation of a number of edit performance and data quality indicators. Changes to the set of edits aiming to ‘relax’ the existing edits so that failure rates decrease and efficiency savings are achieved are assessed by means of several edit-related performance indicators, like failure and hit rates, false hit rates, etc.. Data quality indicators include proportion of errors missed and estimates of the bias resulting from missing errors for a specified revision of the set of edits. Edit designers and managers can then aim to fine tune their edits so that failure rates, false hit rates and editing costs are reduced, while data quality is preserved. An illustration is provided by the application of the tool to revise the edits used for the UK Annual Business Inquiry Part 2 to the reference year 2007.
editing, survey quality, survey process, generic software, quality assessment
Southampton Statistical Sciences Research Institute, University of Southampton
Do Nascimento Silva, Pedro Luis
07f522df-9128-4fca-89f5-3fef81d1a47a
Bucknall, Robert
e8eacec7-080c-46e5-a336-46b72f71e9a7
Zong, Ping
ecc7927c-22db-406b-8171-6ea58df523cd
Al-Hamad, Alaa
7a362a2f-a77d-412e-b52e-aa16fcaf96c1
Do Nascimento Silva, Pedro Luis
07f522df-9128-4fca-89f5-3fef81d1a47a
Bucknall, Robert
e8eacec7-080c-46e5-a336-46b72f71e9a7
Zong, Ping
ecc7927c-22db-406b-8171-6ea58df523cd
Al-Hamad, Alaa
7a362a2f-a77d-412e-b52e-aa16fcaf96c1
Do Nascimento Silva, Pedro Luis, Bucknall, Robert, Zong, Ping and Al-Hamad, Alaa
(2008)
A generic tool to assess impact of changing edit rules in a business survey - an application to the UK Annual Business Inquiry part 2
(S3RI Methodology Working Papers, M08/10)
Southampton, UK.
Southampton Statistical Sciences Research Institute, University of Southampton
17pp.
(Submitted)
Record type:
Monograph
(Working Paper)
Abstract
Business surveys often use complex sets of edit rules (edits, for short) to check returned questionnaires (records), locate suspicious or unacceptable responses, and support data cleaning operations prior to using the survey responses for estimation of the required target parameters. These sets of edits are complex because they may involve large numbers of survey questionnaires and variables, they may contain a large number of edits, and the edits may depend on a large number of tolerance parameters. When such sets of edits are used, they may cause large numbers of record failures and generate substantial costs of revision, especially if edit failures are dealt with by means of clerical operations, like reviewing original paper questionnaires or digital images of these, and re-contacting businesses for clarification and/or correction of the responses provided. Costs can be high both in terms of the resources required, as well as in terms of timeliness of survey processing, by delaying availability of the survey data for estimation and publication.
In this paper we describe a generic tool, developed as a result of the collaboration between the University of Southampton and the ONS. This tool can help to assess the potential impact of changing the edits in a specified business survey. It is a SAS macro using the IML language which enables calculation of a number of edit performance and data quality indicators. Changes to the set of edits aiming to ‘relax’ the existing edits so that failure rates decrease and efficiency savings are achieved are assessed by means of several edit-related performance indicators, like failure and hit rates, false hit rates, etc.. Data quality indicators include proportion of errors missed and estimates of the bias resulting from missing errors for a specified revision of the set of edits. Edit designers and managers can then aim to fine tune their edits so that failure rates, false hit rates and editing costs are reduced, while data quality is preserved. An illustration is provided by the application of the tool to revise the edits used for the UK Annual Business Inquiry Part 2 to the reference year 2007.
Text
63907-01.pdf
- Author's Original
More information
Submitted date: 17 November 2008
Keywords:
editing, survey quality, survey process, generic software, quality assessment
Identifiers
Local EPrints ID: 63907
URI: http://eprints.soton.ac.uk/id/eprint/63907
PURE UUID: a6448236-5182-4fbb-aeb6-c3f4fe665518
Catalogue record
Date deposited: 18 Nov 2008
Last modified: 15 Mar 2024 11:45
Export record
Contributors
Author:
Pedro Luis Do Nascimento Silva
Author:
Robert Bucknall
Author:
Ping Zong
Author:
Alaa Al-Hamad
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics