Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians
Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians
Logit models are popular tools for analyzing discrete choice and ranking data. The models assume that judges rate each item with a measurable utility, and the ordering of a judge’s utilities determines the outcome. Logit models have been proven to be powerful tools, but they become difficult to interpret if the models contain nonlinear and interaction terms. We extended the logit models by adding a decision tree structure to overcome this difficulty. We introduced a new method of tree splitting variable selection that distinguishes the nonlinear and linear effects, and the variable with the strongest nonlinear effect will be selected in the view that linear effect is best modeled using the logit model. Decision trees built in this fashion were shown to have smaller sizes than those using loglikelihood-based splitting criteria. In addition, the proposed splitting methods could save computational time and avoid bias in choosing the optimal splitting variable. Issues on variable selection in logit models are also investigated, and forward selection criterion was shown to work well with logit tree models. Focused on ranking data, simulations are carried out and the results showed that our proposed splitting methods are unbiased. Finally, to demonstrate the feasibility of the logit tree models, they were applied to analyze two datasets, one with binary outcome and the other with ranking outcome.
Binary data, Decision tree, Multinomial data, Ranking data, Variable selection
799-827
Yu, Philip L.H.
67db467c-4f19-4c55-8ad9-0c13faeb15d6
Lee, Paul H.
02620eab-ae7f-4a1c-bad1-8a50e7e48951
Cheung, S. F.
80494bac-30c7-490e-9c4c-4ab80e06374a
Lau, Esther Y.Y.
3d813a04-4ac3-4373-bad7-154f3ea31f5f
Mok, Doris S.Y.
fecc819c-157a-4883-b539-2238017e72e6
Hui, Harry C.
ee44cc99-5d99-45fe-8565-12d1532b2b84
1 June 2016
Yu, Philip L.H.
67db467c-4f19-4c55-8ad9-0c13faeb15d6
Lee, Paul H.
02620eab-ae7f-4a1c-bad1-8a50e7e48951
Cheung, S. F.
80494bac-30c7-490e-9c4c-4ab80e06374a
Lau, Esther Y.Y.
3d813a04-4ac3-4373-bad7-154f3ea31f5f
Mok, Doris S.Y.
fecc819c-157a-4883-b539-2238017e72e6
Hui, Harry C.
ee44cc99-5d99-45fe-8565-12d1532b2b84
Yu, Philip L.H., Lee, Paul H. and Cheung, S. F.
,
et al.
(2016)
Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians.
Computational Statistics, 31 (2), .
(doi:10.1007/s00180-015-0588-4).
Abstract
Logit models are popular tools for analyzing discrete choice and ranking data. The models assume that judges rate each item with a measurable utility, and the ordering of a judge’s utilities determines the outcome. Logit models have been proven to be powerful tools, but they become difficult to interpret if the models contain nonlinear and interaction terms. We extended the logit models by adding a decision tree structure to overcome this difficulty. We introduced a new method of tree splitting variable selection that distinguishes the nonlinear and linear effects, and the variable with the strongest nonlinear effect will be selected in the view that linear effect is best modeled using the logit model. Decision trees built in this fashion were shown to have smaller sizes than those using loglikelihood-based splitting criteria. In addition, the proposed splitting methods could save computational time and avoid bias in choosing the optimal splitting variable. Issues on variable selection in logit models are also investigated, and forward selection criterion was shown to work well with logit tree models. Focused on ranking data, simulations are carried out and the results showed that our proposed splitting methods are unbiased. Finally, to demonstrate the feasibility of the logit tree models, they were applied to analyze two datasets, one with binary outcome and the other with ranking outcome.
This record has no associated files available for download.
More information
Published date: 1 June 2016
Additional Information:
Funding Information:
The research of Philip L. H. Yu was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. HKU 7473/05H). We thank the associate editor and two anonymous referees for their helpful suggestions for improving this article.
Publisher Copyright:
© 2015, Springer-Verlag Berlin Heidelberg.
Keywords:
Binary data, Decision tree, Multinomial data, Ranking data, Variable selection
Identifiers
Local EPrints ID: 475170
URI: http://eprints.soton.ac.uk/id/eprint/475170
ISSN: 0943-4062
PURE UUID: 33feb94e-0570-4f4d-98bd-2c3e1a0e6312
Catalogue record
Date deposited: 10 Mar 2023 17:59
Last modified: 18 Mar 2024 04:09
Export record
Altmetrics
Contributors
Author:
Philip L.H. Yu
Author:
Paul H. Lee
Author:
S. F. Cheung
Author:
Esther Y.Y. Lau
Author:
Doris S.Y. Mok
Author:
Harry C. Hui
Corporate Author: et al.
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics