descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jul 2013Embargo end date: 01 Jan 2012Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Information Theory, volume 59, pages 4,374-4,388 (issn: 0018-9448, eissn: 1557-9654,

Authors: Alfred O. Hero; Dennis Wei; Kumar Sricharan;

doi: 10.1109/tit.2013.2251456 , 10.48550/arxiv.1203.5829

pmid: 25897177

pmc: PMC4401872

arXiv: http://arxiv.org/abs/1203.5829

Ensemble Estimators for Multivariate Entropy Estimation

- Summary
- Subjects
- Metrics

Abstract

The problem of estimation of density functionals like entropy and mutual information has received much attention in the statistics and information theory communities. A large class of estimators of functionals of the probability density suffer from the curse of dimensionality, wherein the mean squared error (MSE) decays increasingly slowly as a function of the sample size $T$ as the dimension $d$ of the samples increases. In particular, the rate is often glacially slow of order $O(T^{-��/{d}})$, where $��>0$ is a rate parameter. Examples of such estimators include kernel density estimators, $k$-nearest neighbor ($k$-NN) density estimators, $k$-NN entropy estimators, intrinsic dimension estimators and other examples. In this paper, we propose a weighted affine combination of an ensemble of such estimators, where optimal weights can be chosen such that the weighted estimator converges at a much faster dimension invariant rate of $O(T^{-1})$. Furthermore, we show that these optimal weights can be determined by solving a convex optimization problem which can be performed offline and does not require training data. We illustrate the superior performance of our weighted estimator for two important applications: (i) estimating the Panter-Dite distortion-rate factor and (ii) estimating the Shannon entropy for testing the probability distribution of a random sample.

version 3: correction of minor typos from version 2

Related Organizations

University of Michigan–Flint
United States
UNIVERSITY OF MICHIGAN
University of Michigan Ann Arbor
United States
University of Michigan–Ann Arbor
United States

Keywords

Methodology (stat.ME), FOS: Computer and information sciences, FOS: Mathematics, Mathematics - Statistics Theory, Statistics Theory (math.ST), Statistics - Methodology

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	36
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

Top 10%

Green

bronze

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Funded by

NIH| Automatic Three Dimensional (3D) Registration for Enhanced Cancer Management