
arXiv: 1806.05826
AbstractSolving linear systems is often the computational bottleneck in real‐life problems. Iterative solvers are the only option due to the complexity of direct algorithms or because the system matrix is not explicitly known. Here, we develop a two‐level preconditioner for regularized least squares linear systems involving a feature or data matrix. Variants of this linear system may appear in machine learning applications, such as ridge regression, logistic regression, support vector machines and Bayesian regression. We use clustering algorithms to create a coarser level that preserves the principal components of the covariance or Gram matrix. This coarser level approximates the dominant eigenvectors and is used to build a subspace preconditioner accelerating the Conjugate Gradient method. We observed speed‐ups for artificial and real‐life data.
Iterative numerical methods for linear systems, Ridge regression; shrinkage estimators (Lasso), Tikhonov regularization, Pattern recognition, speech recognition, Learning and adaptive systems in artificial intelligence, Numerical Analysis (math.NA), 65F08, 65F22, 68T05, 68T10, machine learning, Ill-posedness and regularization problems in numerical linear algebra, preconditioning, large-scale, Krylov subspace methods, ridge regression, FOS: Mathematics, Preconditioners for iterative methods, Mathematics - Numerical Analysis, clustering
Iterative numerical methods for linear systems, Ridge regression; shrinkage estimators (Lasso), Tikhonov regularization, Pattern recognition, speech recognition, Learning and adaptive systems in artificial intelligence, Numerical Analysis (math.NA), 65F08, 65F22, 68T05, 68T10, machine learning, Ill-posedness and regularization problems in numerical linear algebra, preconditioning, large-scale, Krylov subspace methods, ridge regression, FOS: Mathematics, Preconditioners for iterative methods, Mathematics - Numerical Analysis, clustering
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
