
arXiv: 1112.0192
A measure called Physical Complexity is established and calculated for a population of sequences, based on statistical physics, automata theory, and information theory. It is a measure of the quantity of information in an organism's genome. It is based on Shannon's entropy, measuring the information in a population evolved in its environment, by using entropy to estimate the randomness in the genome. It is calculated from the difference between the maximal entropy of the population and the actual entropy of the population when in its environment, estimated by counting the number of fixed loci in the sequences of a population. Up to now, Physical Complexity has only been formulated for populations of sequences with the same length. Here, we investigate an extension to support variable length populations. We then build upon this to construct a measure for the efficiency of information storage, which we later use in understanding clustering within populations. Finally, we investigate our extended Physical Complexity through simulations, showing it to be consistent with the original.
8 pages, 7 figures, journal
Biological Physics (physics.bio-ph), Physics - Data Analysis, Statistics and Probability, FOS: Biological sciences, FOS: Physical sciences, Physics - Biological Physics, Quantitative Biology - Quantitative Methods, Data Analysis, Statistics and Probability (physics.data-an), Quantitative Methods (q-bio.QM)
Biological Physics (physics.bio-ph), Physics - Data Analysis, Statistics and Probability, FOS: Biological sciences, FOS: Physical sciences, Physics - Biological Physics, Quantitative Biology - Quantitative Methods, Data Analysis, Statistics and Probability (physics.data-an), Quantitative Methods (q-bio.QM)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
