
Abstract Motivation: In the last few years, a growing interest in biology has been shifting toward the problem of optimal information extraction from the huge amount of data generated via large-scale and high-throughput techniques. One of the most relevant issues has recently emerged that of correctly and reliably predicting the functions of a given protein with that of functions exploiting information coming from the whole network of proteins physically interacting with the functionally undetermined one. In the present work, we will refer to an ‘observed’ protein as the one present in the protein–protein interaction networks published in the literature. Methods: The method proposed in this paper is based on a message passing algorithm known as Belief Propagation, which accepts the network of protein's physical interactions and a catalog of known protein's functions as input, and returns the probabilities for each unclassified protein of having one chosen function. The implementation of the algorithm allows for fast online analysis, and can easily be generalized into more complex graph topologies taking into account hypergraphs, i.e. complexes of more than two interacting proteins. Results: Benchmarks of our method are the two Saccharomyces cerevisiae protein–protein interaction networks and the Database of Interacting Proteins. The validity of our approach is successfully tested against other available techniques. Contact: leone@isiosf.isi.it Supplementary information: http://isiosf.isi.it/~pagnani
Saccharomyces cerevisiae Proteins, Information Storage and Retrieval, Proteins, FOS: Physical sciences, Saccharomyces cerevisiae, Disordered Systems and Neural Networks (cond-mat.dis-nn), Condensed Matter - Disordered Systems and Neural Networks, Quantitative Biology - Quantitative Methods, Models, Biological, Structure-Activity Relationship, Models, Chemical, Sequence Analysis, Protein, FOS: Biological sciences, Protein Interaction Mapping, Computer Simulation, Algorithms, Quantitative Methods (q-bio.QM), Signal Transduction
Saccharomyces cerevisiae Proteins, Information Storage and Retrieval, Proteins, FOS: Physical sciences, Saccharomyces cerevisiae, Disordered Systems and Neural Networks (cond-mat.dis-nn), Condensed Matter - Disordered Systems and Neural Networks, Quantitative Biology - Quantitative Methods, Models, Biological, Structure-Activity Relationship, Models, Chemical, Sequence Analysis, Protein, FOS: Biological sciences, Protein Interaction Mapping, Computer Simulation, Algorithms, Quantitative Methods (q-bio.QM), Signal Transduction
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 23 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
