Noisy Population Recovery in Polynomial Time

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Oct 2016Embargo end date: 01 Jan 2016Publisher:IEEEJournal:2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS)Funded by:NSF | AF: Small: Efficient Appr...

Authors: Anindya De; Michael E. Saks; Sijian Tang;

doi: 10.1109/focs.2016.77 , 10.48550/arxiv.1602.07616

arXiv: 1602.07616

Noisy Population Recovery in Polynomial Time

- Summary
- Subjects
- Metrics

Abstract

In the noisy population recovery problem of Dvir et al., the goal is to learn an unknown distribution $f$ on binary strings of length $n$ from noisy samples. For some parameter $μ\in [0,1]$, a noisy sample is generated by flipping each coordinate of a sample from $f$ independently with probability $(1-μ)/2$. We assume an upper bound $k$ on the size of the support of the distribution, and the goal is to estimate the probability of any string to within some given error $\varepsilon$. It is known that the algorithmic complexity and sample complexity of this problem are polynomially related to each other. We show that for $μ> 0$, the sample complexity (and hence the algorithmic complexity) is bounded by a polynomial in $k$, $n$ and $1/\varepsilon$ improving upon the previous best result of $\mathsf{poly}(k^{\log\log k},n,1/\varepsilon)$ due to Lovett and Zhang. Our proof combines ideas from Lovett and Zhang with a \emph{noise attenuated} version of Möbius inversion. In turn, the latter crucially uses the construction of \emph{robust local inverse} due to Moitra and Saks.

Related Organizations

Northwestern University
United States
Rutgers, The State University of New Jersey
United States
NORTHWESTERN UNIVERSITY
United States
Northwestern University
United States
Northeastern University
United States

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Computational Complexity, Computer Science - Machine Learning, Computer Science - Data Structures and Algorithms, Data Structures and Algorithms (cs.DS), Computational Complexity (cs.CC), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Average

Green

Fields of Science (3) View all

Fields of Science

Funded by

NSF| AF: Small: Efficient Approximations for Dynamic Programs and Other Topics in Algorithms