Impossibility Results for Grammar-Compressed Linear Algebra

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2020Embargo end date: 01 Jan 2020Publisher:arXivJournal:CoRR, volume abs/2010.14181Funded by:EC | TIPEA, NSF | Collaborative Research: A...

Authors: Abboud, A.; Backurs, A.; Bringmann, K. ; https://orcid.org/0000-0003-1356-5177; Künnemann, M.;

doi: 10.48550/arxiv.2010.14181

arXiv: 2010.14181

handle: 21.11116/0000-0007-90DF-B

Impossibility Results for Grammar-Compressed Linear Algebra

- Summary
- Subjects
- Related research
  (5)
- External Databases
  (1)
- Metrics

Abstract

To handle vast amounts of data, it is natural and popular to compress vectors and matrices. When we compress a vector from size $N$ down to size $n \ll N$, it certainly makes it easier to store and transmit efficiently, but does it also make it easier to process? In this paper we consider lossless compression schemes, and ask if we can run our computations on the compressed data as efficiently as if the original data was that small. That is, if an operation has time complexity $T(\rm{inputsize})$, can we perform it on the compressed representation in time $T(n)$ rather than $T(N)$? We consider the most basic linear algebra operations: inner product, matrix-vector multiplication, and matrix multiplication. In particular, given two compressed vectors, can we compute their inner product in time $O(n)$? Or perhaps we must decompress first and then multiply, spending $��(N)$ time? The answer depends on the compression scheme. While for simple ones such as Run-Length-Encoding (RLE) the inner product can be done in $O(n)$ time, we prove that this is impossible for compressions from a richer class: essentially $n^2$ or even larger runtimes are needed in the worst case (under complexity assumptions). This is the class of grammar-compressions containing most popular methods such as the Lempel-Ziv family. These schemes are more compressing than the simple RLE, but alas, we prove that performing computations on them is much harder.

NeurIPS'20, 20 pages

Related Organizations

Saarland University
Germany
TOYOTA TECHNOLOGICAL INSTITUTE / CHICAGO
Max Planck Society
Germany
IBM Almaden Research Center
IBM Almaden Research Center
United States

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Computational Complexity, Computer Science - Machine Learning, Computational Complexity (cs.CC), Machine Learning (cs.LG)

5 Research products, page 1 of 1

Constant-Time Tree Traversal and Subtree Equality Check for Grammar-Compressed Trees
2017IsAmongTopNSimilarDocuments
Random Access to Grammar-Compressed Strings and Trees
2015IsAmongTopNSimilarDocuments
Access, Rank, and Select in Grammar-compressed Strings
2015IsAmongTopNSimilarDocuments
Fast, Small, and Simple Document Listing on Repetitive Text Collections
2019IsAmongTopNSimilarDocuments
Rank, select and access in grammar-compressed strings
2014IsAmongTopNSimilarDocuments

3sum

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average