<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
pmid: 10057305
We extend the Zipf approach to analyzing linguistic texts to the statistical study of DNA base pair sequences and find that the noncoding regions are more similar to natural languages than the coding regions. We also adapt the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and demonstrate that noncoding regions in eukaryotes display a smaller entropy and larger redundancy than coding regions, supporting the possibility that noncoding regions of DNA may carry biological information.
DNA, Bacterial, Base Sequence, Statistics as Topic, Linguistics, Sequence Analysis, DNA, DNA, Helminth, Eukaryotic Cells, Codon, Nonsense, Genetic Code, DNA, Viral, Animals, Humans, DNA, Fungal, Algorithms, Language
DNA, Bacterial, Base Sequence, Statistics as Topic, Linguistics, Sequence Analysis, DNA, DNA, Helminth, Eukaryotic Cells, Codon, Nonsense, Genetic Code, DNA, Viral, Animals, Humans, DNA, Fungal, Algorithms, Language
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 250 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |