
arXiv: 1810.04651
AbstractWe propose a new method for supervised learning, the “principal components lasso” (“pcLasso”). It combines the lasso (ℓ1) penalty with a quadratic penalty that shrinks the coefficient vector toward the feature matrix's leading principal components (PCs). pcLasso can be especially powerful if the features are preassigned to groups. In that case, pcLasso shrinks each group‐wise component of the solution toward the leading PCs of that group. The pcLasso method also carries out selection of feature groups. We provide some theory and illustrate the method on some simulated and real data examples.
FOS: Computer and information sciences, Ridge regression; shrinkage estimators (Lasso), sparsity, Machine Learning (stat.ML), Factor analysis and principal components; correspondence analysis, supervised learning, Methodology (stat.ME), feature group selection, Statistics - Machine Learning, Lasso, Statistics - Methodology, principal components
FOS: Computer and information sciences, Ridge regression; shrinkage estimators (Lasso), sparsity, Machine Learning (stat.ML), Factor analysis and principal components; correspondence analysis, supervised learning, Methodology (stat.ME), feature group selection, Statistics - Machine Learning, Lasso, Statistics - Methodology, principal components
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 9 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
