Variable selection in general multinomial logit models

descriptionPublicationkeyboard_double_arrow_right Article , Research 01 Feb 2015 Germany English Publisher:Elsevier BVJournal:Computational Statistics & Data Analysis, volume 82, pages 207-222 (issn: 0167-9473,

Copyright policy )

Authors: Gerhard Tutz 0001; Wolfgang Pößnecker; Lorenz Uhlmann;

doi: 10.1016/j.csda.2014.09.009 , 10.5282/ubm/epub.13114 , 10.5282/ubm/epub.14063

Variable selection in general multinomial logit models

- Summary
- Subjects
- Metrics

Abstract

The use of the multinomial logit model is typically restricted to applications with few predictors, because in high-dimensional settings maximum likelihood estimates tend to deteriorate. In this paper we are proposing a sparsity-inducing penalty that accounts for the special structure of multinomial models. In contrast to existing methods, it penalizes the parameters that are linked to one variable in a grouped way and thus yields variable selection instead of parameter selection. We develop a proximal gradient method that is able to efficiently compute stable estimates. In addition, the penalization is extended to the important case of predictors that vary across response categories. We apply our estimator to the modeling of party choice of voters in Germany including voter-specific variables like age and gender but also party-specific features like stance on nuclear energy and immigration.

Country

Germany

Related Organizations

Ludwig-Maximilians-Universität München
Germany
Heidelberg University
Germany

Keywords

Generalized linear models (logistic models), Ridge regression; shrinkage estimators (Lasso), logistic regression, group Lasso, 510, CATS lasso, Logistic regression, Multinomial logit model, Variable selection, Lasso, Group Lasso, CATS Lasso., Nonparametric regression and quantile regression, Lasso, Computational methods for problems pertaining to statistics, multinomial logit model, variable selection

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	38
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%