
arXiv: 0811.4167
We propose a penalized orthogonal-components regression (POCRE) for large p small n data. Orthogonal components are sequentially constructed to maximize, upon standardization, their correlation to the response residuals. A new penalization framework, implemented via empirical Bayes thresholding, is presented to effectively identify sparse predictors of each component. POCRE is computationally efficient owing to its sequential construction of leading sparse principal components. In addition, such construction offers other properties such as grouping highly correlated predictors and allowing for collinear or nearly collinear predictors. With multivariate responses, POCRE can construct common components and thus build up latent-variable models for large p small n data.
12 pages
FOS: Computer and information sciences, latent-variable model, Ridge regression; shrinkage estimators (Lasso), Latent-variable model, Measures of association (correlation, canonical correlation, etc.), Empirical Bayes thresholding, empirical Bayes thresholding, Machine Learning (stat.ML), supervised dimension reduction, \(p\gg n\) data, Methodology (stat.ME), 62J07, Sparse predictors, 62J05, POCRE, Statistics - Machine Learning, p≫n data, General nonlinear regression, sparse predictors, Supervised dimension reduction, Statistics - Methodology, 62H20
FOS: Computer and information sciences, latent-variable model, Ridge regression; shrinkage estimators (Lasso), Latent-variable model, Measures of association (correlation, canonical correlation, etc.), Empirical Bayes thresholding, empirical Bayes thresholding, Machine Learning (stat.ML), supervised dimension reduction, \(p\gg n\) data, Methodology (stat.ME), 62J07, Sparse predictors, 62J05, POCRE, Statistics - Machine Learning, p≫n data, General nonlinear regression, sparse predictors, Supervised dimension reduction, Statistics - Methodology, 62H20
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 18 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
