A statistical physics perspective on alignment-independent protein sequence comparison

Article, Preprint English OPEN
Chattopadhyay, Amit K ; Nasiev, Diar ; Flower, Darren R (2016)
  • Publisher: Oxford University Press
  • Journal: Bioinformatics, volume 31, issue 15, pages 2,469-2,474 (issn: 1367-4803, eissn: 1367-4811)
  • Related identifiers: doi: 10.1093/bioinformatics/btv167, pmc: PMC4514925
  • Subject: Quantitative Biology - Biomolecules | Quantitative Biology - Quantitative Methods | Sequence Analysis | Original Papers

Motivation: Within bioinformatics, the textual alignment of amino acid sequences has long dominated the determination of similarity between proteins, with all that implies for shared structure, function and evolutionary descent. Despite the relative success of modern-day sequence alignment algorithms, so-called alignment-free approaches offer a complementary means of determining and expressing similarity, with potential benefits in certain key applications, such as regression analysis of protein structure-function studies, where alignment-base similarity has performed poorly. Results: Here, we offer a fresh, statistical physics-based perspective focusing on the question of alignment-free comparison, in the process adapting results from ‘first passage probability distribution’ to summarize statistics of ensemble averaged amino acid propensity values. In this article, we introduce and elaborate this approach. Contact: d.r.flower@aston.ac.uk
Share - Bookmark