
arXiv: 1311.4472
AbstractWe propose a new sparse regression method called thecomponent lasso, based on a simple idea. The method uses the connected‐components structure of the sample covariance matrix to split the problem into smaller ones. It then applies the lasso to each subproblem separately, obtaining a coefficient vector for each one. Finally, it uses non‐negative least squares to recombine the different vectors into a single solution. This step is useful in selecting and reweighting components that are correlated with the response. We prove that the component lasso is strongly sign consistent in a block‐diagonal setting. Simulated and real data examples show that the component lasso can outperform standard regression methods such as the lasso and elastic net, achieving a lower mean squared error as well as better support recovery. The modular structure of the algorithm also lends itself naturally to parallel computation.The Canadian Journal of Statistics43: 624–646; 2015 © 2015 Statistical Society of Canada
FOS: Computer and information sciences, Computer Science - Machine Learning, Ridge regression; shrinkage estimators (Lasso), Linear regression; mixed models, strong irrepresentable condition, sparsity, graphical Lasso, Machine Learning (stat.ML), elastic net, Machine Learning (cs.LG), 62J07, connected components, Statistics - Machine Learning, non-negative least squares, Lasso
FOS: Computer and information sciences, Computer Science - Machine Learning, Ridge regression; shrinkage estimators (Lasso), Linear regression; mixed models, strong irrepresentable condition, sparsity, graphical Lasso, Machine Learning (stat.ML), elastic net, Machine Learning (cs.LG), 62J07, connected components, Statistics - Machine Learning, non-negative least squares, Lasso
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
