The University of Southampton
University of Southampton Institutional Repository

Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians

Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians
Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians
Logit models are popular tools for analyzing discrete choice and ranking data. The models assume that judges rate each item with a measurable utility, and the ordering of a judge’s utilities determines the outcome. Logit models have been proven to be powerful tools, but they become difficult to interpret if the models contain nonlinear and interaction terms. We extended the logit models by adding a decision tree structure to overcome this difficulty. We introduced a new method of tree splitting variable selection that distinguishes the nonlinear and linear effects, and the variable with the strongest nonlinear effect will be selected in the view that linear effect is best modeled using the logit model. Decision trees built in this fashion were shown to have smaller sizes than those using loglikelihood-based splitting criteria. In addition, the proposed splitting methods could save computational time and avoid bias in choosing the optimal splitting variable. Issues on variable selection in logit models are also investigated, and forward selection criterion was shown to work well with logit tree models. Focused on ranking data, simulations are carried out and the results showed that our proposed splitting methods are unbiased. Finally, to demonstrate the feasibility of the logit tree models, they were applied to analyze two datasets, one with binary outcome and the other with ranking outcome.
Binary data, Decision tree, Multinomial data, Ranking data, Variable selection
0943-4062
799-827
Yu, Philip L.H.
67db467c-4f19-4c55-8ad9-0c13faeb15d6
Lee, Paul H.
02620eab-ae7f-4a1c-bad1-8a50e7e48951
Cheung, S. F.
80494bac-30c7-490e-9c4c-4ab80e06374a
Lau, Esther Y.Y.
3d813a04-4ac3-4373-bad7-154f3ea31f5f
Mok, Doris S.Y.
fecc819c-157a-4883-b539-2238017e72e6
Hui, Harry C.
ee44cc99-5d99-45fe-8565-12d1532b2b84
et al.
Yu, Philip L.H.
67db467c-4f19-4c55-8ad9-0c13faeb15d6
Lee, Paul H.
02620eab-ae7f-4a1c-bad1-8a50e7e48951
Cheung, S. F.
80494bac-30c7-490e-9c4c-4ab80e06374a
Lau, Esther Y.Y.
3d813a04-4ac3-4373-bad7-154f3ea31f5f
Mok, Doris S.Y.
fecc819c-157a-4883-b539-2238017e72e6
Hui, Harry C.
ee44cc99-5d99-45fe-8565-12d1532b2b84

Yu, Philip L.H., Lee, Paul H. and Cheung, S. F. , et al. (2016) Logit tree models for discrete choice data with application to advice-seeking preferences among Chinese Christians. Computational Statistics, 31 (2), 799-827. (doi:10.1007/s00180-015-0588-4).

Record type: Article

Abstract

Logit models are popular tools for analyzing discrete choice and ranking data. The models assume that judges rate each item with a measurable utility, and the ordering of a judge’s utilities determines the outcome. Logit models have been proven to be powerful tools, but they become difficult to interpret if the models contain nonlinear and interaction terms. We extended the logit models by adding a decision tree structure to overcome this difficulty. We introduced a new method of tree splitting variable selection that distinguishes the nonlinear and linear effects, and the variable with the strongest nonlinear effect will be selected in the view that linear effect is best modeled using the logit model. Decision trees built in this fashion were shown to have smaller sizes than those using loglikelihood-based splitting criteria. In addition, the proposed splitting methods could save computational time and avoid bias in choosing the optimal splitting variable. Issues on variable selection in logit models are also investigated, and forward selection criterion was shown to work well with logit tree models. Focused on ranking data, simulations are carried out and the results showed that our proposed splitting methods are unbiased. Finally, to demonstrate the feasibility of the logit tree models, they were applied to analyze two datasets, one with binary outcome and the other with ranking outcome.

This record has no associated files available for download.

More information

Published date: 1 June 2016
Additional Information: Funding Information: The research of Philip L. H. Yu was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. HKU 7473/05H). We thank the associate editor and two anonymous referees for their helpful suggestions for improving this article. Publisher Copyright: © 2015, Springer-Verlag Berlin Heidelberg.
Keywords: Binary data, Decision tree, Multinomial data, Ranking data, Variable selection

Identifiers

Local EPrints ID: 475170
URI: http://eprints.soton.ac.uk/id/eprint/475170
ISSN: 0943-4062
PURE UUID: 33feb94e-0570-4f4d-98bd-2c3e1a0e6312
ORCID for Paul H. Lee: ORCID iD orcid.org/0000-0002-5729-6450

Catalogue record

Date deposited: 10 Mar 2023 17:59
Last modified: 18 Mar 2024 04:09

Export record

Altmetrics

Contributors

Author: Philip L.H. Yu
Author: Paul H. Lee ORCID iD
Author: S. F. Cheung
Author: Esther Y.Y. Lau
Author: Doris S.Y. Mok
Author: Harry C. Hui
Corporate Author: et al.

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×