A property of the CHAID partitioning method for
dichotomous randomized response data and categorical predictors
A property of the CHAID partitioning method for
dichotomous randomized response data and categorical predictors
In this paper, we present empirical and theoretical results on classification trees for randomized response data. We considered a dichotomous sensitive response variable with the true status intentionally misclassified by the respondents using rules prescribed by a randomized response method. We assumed that classification trees are grown using the Pearson chi-square test as a splitting criterion, and that the randomized response data are analyzed using classification trees as if they were not perturbed. We proved that classification trees analyzing observed randomized response data and estimated true data have a one-to-one correspondence in terms of ranking the splitting variables. This is illustrated using two real data sets
76-90
Perri, Pier Francesco
47d34ceb-e064-4704-a4c4-f16c1862e939
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
2012
Perri, Pier Francesco
47d34ceb-e064-4704-a4c4-f16c1862e939
van der Heijden, Peter G.M.
85157917-3b33-4683-81be-713f987fd612
Perri, Pier Francesco and van der Heijden, Peter G.M.
(2012)
A property of the CHAID partitioning method for
dichotomous randomized response data and categorical predictors.
Journal of Classification, 29 (1), .
(doi:10.1007/s00357-011-9094-8).
Abstract
In this paper, we present empirical and theoretical results on classification trees for randomized response data. We considered a dichotomous sensitive response variable with the true status intentionally misclassified by the respondents using rules prescribed by a randomized response method. We assumed that classification trees are grown using the Pearson chi-square test as a splitting criterion, and that the randomized response data are analyzed using classification trees as if they were not perturbed. We proved that classification trees analyzing observed randomized response data and estimated true data have a one-to-one correspondence in terms of ranking the splitting variables. This is illustrated using two real data sets
This record has no associated files available for download.
More information
Published date: 2012
Organisations:
Statistical Sciences Research Institute
Identifiers
Local EPrints ID: 344645
URI: http://eprints.soton.ac.uk/id/eprint/344645
ISSN: 0176-4268
PURE UUID: de674c54-e1d7-4b53-8621-186deb941ca1
Catalogue record
Date deposited: 07 Nov 2012 15:06
Last modified: 15 Mar 2024 03:46
Export record
Altmetrics
Contributors
Author:
Pier Francesco Perri
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics