
The use of the multinomial logit model is typically restricted to applications with few predictors, because in high-dimensional settings maximum likelihood estimates tend to deteriorate. In this paper we are proposing a sparsity-inducing penalty that accounts for the special structure of multinomial models. In contrast to existing methods, it penalizes the parameters that are linked to one variable in a grouped way and thus yields variable selection instead of parameter selection. We develop a proximal gradient method that is able to efficiently compute stable estimates. In addition, the penalization is extended to the important case of predictors that vary across response categories. We apply our estimator to the modeling of party choice of voters in Germany including voter-specific variables like age and gender but also party-specific features like stance on nuclear energy and immigration.
Generalized linear models (logistic models), Ridge regression; shrinkage estimators (Lasso), logistic regression, group Lasso, 510, CATS lasso, Logistic regression, Multinomial logit model, Variable selection, Lasso, Group Lasso, CATS Lasso., Nonparametric regression and quantile regression, Lasso, Computational methods for problems pertaining to statistics, multinomial logit model, variable selection
Generalized linear models (logistic models), Ridge regression; shrinkage estimators (Lasso), logistic regression, group Lasso, 510, CATS lasso, Logistic regression, Multinomial logit model, Variable selection, Lasso, Group Lasso, CATS Lasso., Nonparametric regression and quantile regression, Lasso, Computational methods for problems pertaining to statistics, multinomial logit model, variable selection
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 38 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
