
arXiv: 2010.10375
AbstractIn order to handle large datasets omnipresent in modern science, efficient compression algorithms are necessary. Here, a Bayesian data compression (BDC) algorithm that adapts to the specific measurement situation is derived in the context of signal reconstruction. BDC compresses a dataset under conservation of its posterior structure with minimal information loss given the prior knowledge on the signal, the quantity of interest. Its basic form is valid for Gaussian priors and likelihoods. For constant noise standard deviation, basic BDC becomes equivalent to a Bayesian analog of principal component analysis. Using metric Gaussian variational inference, BDC generalizes to non‐linear settings. In its current form, BDC requires the storage of effective instrument response functions for the compressed data and corresponding noise encoding the posterior covariance structure. Their memory demand counteract the compression gain. In order to improve this, sparsity of the compressed responses can be obtained by separating the data into patches and compressing them separately. The applicability of BDC is demonstrated by applying it to synthetic data and radio astronomical data. Still the algorithm needs further improvement as the computation time of the compression and subsequent inference exceeds the time of the inference with the original data.
Signal theory (characterization, reconstruction, filtering, etc.), Gaussian likelihood, Bayesian inference, FOS: Physical sciences, Informational aspects of data analysis and big data, Bayesian statistics, lossy compression, Physics - Data Analysis, Statistics and Probability, signal reconstruction, Coding and information theory (compaction, compression, models of communication, encoding schemes, etc.) (aspects in computer science), Astrophysics - Instrumentation and Methods for Astrophysics, Instrumentation and Methods for Astrophysics (astro-ph.IM), data compression, Data Analysis, Statistics and Probability (physics.data-an), information theory, ddc: ddc:
Signal theory (characterization, reconstruction, filtering, etc.), Gaussian likelihood, Bayesian inference, FOS: Physical sciences, Informational aspects of data analysis and big data, Bayesian statistics, lossy compression, Physics - Data Analysis, Statistics and Probability, signal reconstruction, Coding and information theory (compaction, compression, models of communication, encoding schemes, etc.) (aspects in computer science), Astrophysics - Instrumentation and Methods for Astrophysics, Instrumentation and Methods for Astrophysics (astro-ph.IM), data compression, Data Analysis, Statistics and Probability (physics.data-an), information theory, ddc: ddc:
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
