Estimating the prevalence of sensitive behaviour and cheating with a dual design for direct questioning and randomized response
Estimating the prevalence of sensitive behaviour and cheating with a dual design for direct questioning and randomized response
Randomized response is a misclassification design to estimate the prevalence of sensitive behaviour. Respondents who do not follow the instructions of the design are considered to be cheating. A mixture model is proposed to estimate the prevalence of sensitive behaviour and cheating in the case of a dual sampling scheme with direct questioning and randomized response. The mixing weight is the probability of cheating, where cheating is modelled separately for direct questioning and randomized response. For Bayesian inference, Markov chain Monte Carlo sampling is applied to sample parameter values from the posterior. The model makes it possible to analyse dual sample scheme data in a unified way and to assess cheating for direct questions as well as for randomized response questions. The research is illustrated with randomized response data concerning violations of regulations for social benefit
bayesian inference, cheating, misclassification, sensitive items, social benefit fraud
723-736
van den Hout, Ardo
8df0fd5b-6578-4a1f-ba11-790b56e8af55
Böckenholt, Ulf
580c976b-4040-4cb2-877c-35e042a224df
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
August 2010
van den Hout, Ardo
8df0fd5b-6578-4a1f-ba11-790b56e8af55
Böckenholt, Ulf
580c976b-4040-4cb2-877c-35e042a224df
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
van den Hout, Ardo, Böckenholt, Ulf and van der Heijden, Peter G.M.
(2010)
Estimating the prevalence of sensitive behaviour and cheating with a dual design for direct questioning and randomized response.
Journal of the Royal Statistical Society. Series C: Applied Statistics, 59 (4), .
(doi:10.1111/j.1467-9876.2010.00720.x).
(PMID:21461334)
Abstract
Randomized response is a misclassification design to estimate the prevalence of sensitive behaviour. Respondents who do not follow the instructions of the design are considered to be cheating. A mixture model is proposed to estimate the prevalence of sensitive behaviour and cheating in the case of a dual sampling scheme with direct questioning and randomized response. The mixing weight is the probability of cheating, where cheating is modelled separately for direct questioning and randomized response. For Bayesian inference, Markov chain Monte Carlo sampling is applied to sample parameter values from the posterior. The model makes it possible to analyse dual sample scheme data in a unified way and to assess cheating for direct questions as well as for randomized response questions. The research is illustrated with randomized response data concerning violations of regulations for social benefit
This record has no associated files available for download.
More information
e-pub ahead of print date: 21 June 2010
Published date: August 2010
Keywords:
bayesian inference, cheating, misclassification, sensitive items, social benefit fraud
Organisations:
Statistical Sciences Research Institute
Identifiers
Local EPrints ID: 344660
URI: http://eprints.soton.ac.uk/id/eprint/344660
ISSN: 0035-9254
PURE UUID: b6ba305b-eeac-4bfc-9dac-5c551062cfe3
Catalogue record
Date deposited: 07 Nov 2012 15:22
Last modified: 15 Mar 2024 03:46
Export record
Altmetrics
Contributors
Author:
Ardo van den Hout
Author:
Ulf Böckenholt
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics