
arXiv: 2106.02755
Low-rank approximation of kernels is a fundamental mathematical problem with widespread algorithmic applications. Often the kernel is restricted to an algebraic variety, e.g., in problems involving sparse or low-rank data. We show that significantly better approximations are obtainable in this setting: the rank required to achieve a given error depends on the variety's dimension rather than the ambient dimension, which is typically much larger. This is true in both high-precision and high-dimensional regimes. Our results are presented for smooth isotropic kernels, the predominant class of kernels used in applications. Our main technical insight is to approximate smooth kernels by polynomial kernels, and leverage two key properties of polynomial kernels that hold when they are restricted to a variety. First, their ranks decrease exponentially in the variety's co-dimension. Second, their maximum values are governed by their values over a small set of points. Together, our results provide a general approach for exploiting (approximate) "algebraic structure" in datasets in order to efficiently solve large-scale data science problems.
FOS: Computer and information sciences, Relations of manifolds and cell complexes with computer and data science, Computer Science - Machine Learning, algebraic variety, kernel approximation, Computational aspects in algebraic geometry, Machine Learning (cs.LG), Mathematics - Algebraic Geometry, Approximation by polynomials, Statistics on algebraic and topological structures, effective dimension, norming set, Computer Science - Data Structures and Algorithms, polynomial approximation, FOS: Mathematics, Data Structures and Algorithms (cs.DS), data science, kernel rank, Algebraic Geometry (math.AG)
FOS: Computer and information sciences, Relations of manifolds and cell complexes with computer and data science, Computer Science - Machine Learning, algebraic variety, kernel approximation, Computational aspects in algebraic geometry, Machine Learning (cs.LG), Mathematics - Algebraic Geometry, Approximation by polynomials, Statistics on algebraic and topological structures, effective dimension, norming set, Computer Science - Data Structures and Algorithms, polynomial approximation, FOS: Mathematics, Data Structures and Algorithms (cs.DS), data science, kernel rank, Algebraic Geometry (math.AG)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
