
arXiv: 0812.4547
Constrained least-squares regression problems, such as the Nonnegative Least Squares (NNLS) problem, where the variables are restricted to take only nonnegative values, often arise in applications. Motivated by the recent development of the fast Johnson-Lindestrauss transform, we present a fast random projection type approximation algorithm for the NNLS problem. Our algorithm employs a randomized Hadamard transform to construct a much smaller NNLS problem and solves this smaller problem using a standard NNLS solver. We prove that our approach finds a nonnegative solution vector that, with high probability, is close to the optimum nonnegative solution in a relative error approximation sense. We experimentally evaluate our approach on a large collection of term-document data and verify that it does offer considerable speedups without a significant loss in accuracy. Our analysis is based on a novel random projection type result that might be of independent interest. In particular, given a tall and thin matrix $��\in \mathbb{R}^{n \times d}$ ($n \gg d$) and a vector $y \in \mathbb{R}^d$, we prove that the Euclidean length of $��y$ can be estimated very accurately by the Euclidean length of $\tilde��y$, where $\tilde��$ consists of a small subset of (appropriately rescaled) rows of $��$.
to appear in Linear Algebra and its Applications
FOS: Computer and information sciences, Convex programming, sampling, convex optimization, Non negative least-squares, Quadratic programming, randomized algorithm, least-squares regression problems, quadratic optimization, Numerical mathematical programming methods, fast Johnson-Lindestrauss transform, non negative least-squares, Computer Science - Data Structures and Algorithms, Discrete Mathematics and Combinatorics, Data Structures and Algorithms (cs.DS), Sampling, Hadamard transform, Randomized algorithm, numerical examples, Numerical Analysis, Algebra and Number Theory, random projections, Random projections, optimum nonnegative solution, Geometry and Topology
FOS: Computer and information sciences, Convex programming, sampling, convex optimization, Non negative least-squares, Quadratic programming, randomized algorithm, least-squares regression problems, quadratic optimization, Numerical mathematical programming methods, fast Johnson-Lindestrauss transform, non negative least-squares, Computer Science - Data Structures and Algorithms, Discrete Mathematics and Combinatorics, Data Structures and Algorithms (cs.DS), Sampling, Hadamard transform, Randomized algorithm, numerical examples, Numerical Analysis, Algebra and Number Theory, random projections, Random projections, optimum nonnegative solution, Geometry and Topology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 41 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
