
arXiv: 1909.13574
Evolution consists of distinct stages: cosmological, biological, linguistic. Since biology verges on natural sciences and linguistics, we expect that it shares structures and features from both forms of knowledge. Indeed, in DNA we encounter the biological "atoms", the four nucleotide molecules. At the same time these four nucleotides may be considered as the "letters" of an alphabet. These four "letters", through a genetic code, generate biological "words", "phrases", "sentences" (aminoacids, proteins, cells, living organisms). In this spirit we may consider equally well a DNA strand as a mathematical statement. Inspired by the work of Kurt G��del, we attach to each DNA strand a G��del's number, a product of prime numbers raised to appropriate powers. To each DNA chain corresponds a single G��del's number $G$, and inversely given a G��del's number $G$, we can specify the DNA chain it stands for. Next, considering a single DNA strand composed of $N$ bases, we study the statistical distribution of $g$, the logarithm of $G$. Our assumption is that the choice of the $m$-th term is random and with equal probability for the four possible outcomes. The "experiment", to some extent, appears as throwing $N$ times a four-faces die. Through the moment generating function we obtain the discrete and then the continuum distribution of $g$. There is an excellent agreement between our formalism and simulated data. At the end we compare our formalism to actual data, to specify the presence of traces of non-random dynamics.
11 pages, 2 figures
language representation of biological sequences, FOS: Biological sciences, language theory, nucleotide sequences, Gödel numbering, Other Quantitative Biology (q-bio.OT), Statistical mechanics, structure of matter, Quantitative Biology - Other Quantitative Biology, information theory
language representation of biological sequences, FOS: Biological sciences, language theory, nucleotide sequences, Gödel numbering, Other Quantitative Biology (q-bio.OT), Statistical mechanics, structure of matter, Quantitative Biology - Other Quantitative Biology, information theory
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
