Geometric Aspects of Biological Sequence Comparison

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 01 Apr 2009Embargo end date: 01 Jan 2007 English Publisher:SAGE PublicationsJournal:Journal of Computational Biology, volume 16, pages 579-610 (issn: 1066-5277, eissn: 1557-8666,

Copyright policy )

Authors: Aleksandar Stojmirovic; Yi-Kuo Yu;

doi: 10.1089/cmb.2008.0100 , 10.48550/arxiv.0710.2555

pmid: 19361329

pmc: PMC2801405

arXiv: 0710.2555

Geometric Aspects of Biological Sequence Comparison

- Summary
- Subjects
- Metrics

Abstract

Abstract We introduce a geometric framework suitable for studying the relationships among biological sequences. In contrast to previous works, our formulation allows asymmetric distances (quasi-metrics), originating from uneven weighting of strings, which may induce non-trivial partial orders on sets of biosequences. The distances considered are more general than traditional generalized string edit distances. In particular, our framework enables non-trivial conversion between sequence similarities, both local and global, and distances. Our constructions apply to a wide class of scoring schemes and require much less restrictive gap penalties than the ones regularly used. Numerous examples are provided to illustrate the concepts introduced and their potential applications.

Related Organizations

National Institutes of Health
United States
National Institutes of Health
National Institute of Health
Armenia
National Institute of Health
National Institutes of Health

View all View all

Keywords

Base Sequence, FOS: Biological sciences, Computational Biology, Sequence Homology, Amino Acid Sequence, Quantitative Biology - Quantitative Methods, Algorithms, Quantitative Methods (q-bio.QM)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	10
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

10

Top 10%

Average

Green

hybrid

Fields of Science (3) View all

medical and health sciences

basic medicine

Fields of Science

medical and health sciences

basic medicine

View all