
pmid: 19644170
handle: 10092/2791
The Robinson-Foulds (RF) distance is by far the most widely used measure of dissimilarity between trees. Although the distribution of these distances has been investigated for twenty years, an algorithm that is explicitly polynomial time has yet to be described for computing this distribution (which is also the distribution of trees around a given tree under the popular Robinson-Foulds metric). In this paper we derive a polynomial-time algorithm for this distribution. We show how the distribution can be approximated by a Poisson distribution determined by the proportion of leaves that lie in `cherries' of the given tree. We also describe how our results can be used to derive normalization constants that are required in a recently-proposed maximum likelihood approach to supertree construction.
16 pages, 3 figures
Fields of Research::270000 Biological Sciences::270200 Genetics, Models, Genetic, Populations and Evolution (q-bio.PE), 511, trees, Quantitative Biology - Quantitative Methods, discrete mathematics applications, Fields of Research::230000 Mathematical Sciences::230200 Statistics::230204 Applied statistics, phylogenetics, Robinson-Foulds distance, normalization constant, FOS: Biological sciences, Poisson approximation, Poisson Distribution, biology and genetics, Quantitative Biology - Populations and Evolution, Algorithms, Phylogeny, Quantitative Methods (q-bio.QM)
Fields of Research::270000 Biological Sciences::270200 Genetics, Models, Genetic, Populations and Evolution (q-bio.PE), 511, trees, Quantitative Biology - Quantitative Methods, discrete mathematics applications, Fields of Research::230000 Mathematical Sciences::230200 Statistics::230204 Applied statistics, phylogenetics, Robinson-Foulds distance, normalization constant, FOS: Biological sciences, Poisson approximation, Poisson Distribution, biology and genetics, Quantitative Biology - Populations and Evolution, Algorithms, Phylogeny, Quantitative Methods (q-bio.QM)
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 39 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
