A component lasso

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 25 Nov 2015Embargo end date: 01 Jan 2013 English Publisher:WileyJournal:Canadian Journal of Statistics, volume 43, pages 624-646 (issn: 0319-5724, eissn: 1708-945X,

Copyright policy )Funded by:NSF | Flexible and Adaptive Sta..., NIH | NHLBI PROTEOMICS INITIATI...

Authors: Nadine Hussami; Robert Tibshirani;

doi: 10.1002/cjs.11267 , 10.48550/arxiv.1311.4472

arXiv: 1311.4472

A component lasso

- Summary
- Subjects
- Metrics

Abstract

AbstractWe propose a new sparse regression method called thecomponent lasso, based on a simple idea. The method uses the connected‐components structure of the sample covariance matrix to split the problem into smaller ones. It then applies the lasso to each subproblem separately, obtaining a coefficient vector for each one. Finally, it uses non‐negative least squares to recombine the different vectors into a single solution. This step is useful in selecting and reweighting components that are correlated with the response. We prove that the component lasso is strongly sign consistent in a block‐diagonal setting. Simulated and real data examples show that the component lasso can outperform standard regression methods such as the lasso and elastic net, achieving a lower mean squared error as well as better support recovery. The modular structure of the algorithm also lends itself naturally to parallel computation.The Canadian Journal of Statistics43: 624–646; 2015 © 2015 Statistical Society of Canada

Related Organizations

Stanford University
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Ridge regression; shrinkage estimators (Lasso), Linear regression; mixed models, strong irrepresentable condition, sparsity, graphical Lasso, Machine Learning (stat.ML), elastic net, Machine Learning (cs.LG), 62J07, connected components, Statistics - Machine Learning, non-negative least squares, Lasso

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Average

Green

Fields of Science

Fields of Science

Funded by

NSF| Flexible and Adaptive Statistical Modeling, NIH| NHLBI PROTEOMICS INITIATIVE-268028183