
AbstractMotivationAccumulated clinical studies show that microbes living in humans interact closely with human hosts, and get involved in modulating drug efficacy and drug toxicity. Microbes have become novel targets for the development of antibacterial agents. Therefore, screening of microbe–drug associations can benefit greatly drug research and development. With the increase of microbial genomic and pharmacological datasets, we are greatly motivated to develop an effective computational method to identify new microbe–drug associations.ResultsIn this article, we proposed a novel method, Graph2MDA, to predict microbe–drug associations by using variational graph autoencoder (VGAE). We constructed multi-modal attributed graphs based on multiple features of microbes and drugs, such as molecular structures, microbe genetic sequences and function annotations. Taking as input the multi-modal attribute graphs, VGAE was trained to learn the informative and interpretable latent representations of each node and the whole graph, and then a deep neural network classifier was used to predict microbe–drug associations. The hyperparameter analysis and model ablation studies showed the sensitivity and robustness of our model. We evaluated our method on three independent datasets and the experimental results showed that our proposed method outperformed six existing state-of-the-art methods. We also explored the meaning of the learned latent representations of drugs and found that the drugs show obvious clustering patterns that are significantly consistent with drug ATC classification. Moreover, we conducted case studies on two microbes and two drugs and found 75–95% predicted associations have been reported in PubMed literature. Our extensive performance evaluations validated the effectiveness of our proposed method.Availability and implementationSource codes and preprocessed data are available at https://github.com/moen-hyb/Graph2MDA.Supplementary informationSupplementary data are available at Bioinformatics online.
FOS: Computer and information sciences, Computer Science - Machine Learning, FOS: Biological sciences, Humans, Computational Biology, Neural Networks, Computer, Quantitative Biology - Quantitative Methods, Algorithms, Software, Quantitative Methods (q-bio.QM), Machine Learning (cs.LG)
FOS: Computer and information sciences, Computer Science - Machine Learning, FOS: Biological sciences, Humans, Computational Biology, Neural Networks, Computer, Quantitative Biology - Quantitative Methods, Algorithms, Software, Quantitative Methods (q-bio.QM), Machine Learning (cs.LG)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 67 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
