Path Matching and Graph Matching in Biological Networks

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2007 English Publisher:SAGE PublicationsJournal:Journal of Computational Biology, volume 14, pages 56-67 (issn: 1066-5277, eissn: 1557-8666,

Copyright policy )

Authors: Qingwu Yang; Sing-Hoi Sze;

doi: 10.1089/cmb.2006.0076

pmid: 17381346

Path Matching and Graph Matching in Biological Networks

- Summary
- Subjects
- Metrics

Abstract

We develop algorithms for the following path matching and graph matching problems: (i) given a query path p and a graph G, find a path p' that is most similar to p in G; (ii) given a query graph G (0) and a graph G, find a graph G (0)' that is most similar to G (0) in G. In these problems, p and G (0) represent a given substructure of interest to a biologist, and G represents a large network in which the biologist desires to find a related substructure. These algorithms allow the study of common substructures in biological networks in order to understand how these networks evolve both within and between organisms. We reduce the path matching problem to finding a longest weighted path in a directed acyclic graph and show that the problem of finding top k suboptimal paths can be solved in polynomial time. This is in contrast with most previous approaches that used exponential time algorithms to find simple paths which are practical only when the paths are short. We reduce the graph matching problem to finding highest scoring subgraphs in a graph and give an exact algorithm to solve the problem when the query graph G (0) is of moderate size. This eliminates the need for less accurate heuristic or randomized algorithms. We show that our algorithms are able to extract biologically meaningful pathways from protein interaction networks in the DIP database and metabolic networks in the KEGG database. Software programs implementing these techniques (PathMatch and GraphMatch) are available at http://faculty.cs.tamu.edu/shsze/pathmatch and http://faculty.cs.tamu.edu/shsze/graphmatch.

Related Organizations

The University of Texas System
United States
Texas A&M University
United States

Keywords

Drosophila melanogaster, Helicobacter pylori, Protein Interaction Mapping, Animals, Saccharomyces cerevisiae, Caenorhabditis elegans, Algorithms, Metabolic Networks and Pathways, Software

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	64
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%