
Abstract This research paper investigates a machine learning based approach for Drug-Drug Interaction (DDI) extraction for determining the side effects of multi drugs when prescribed simultaneously. In our proposed model we used TAC 2017 Dataset, which has Adverse Drug Reactions (ADRs) data for the classification ofdrug-drug interaction. TAC 2017 Dataset has various types of information which are related to drugs and their interactions. Our method uses Term Frequency-Inverse Document Frequency (TF-IDF) to transform the textual descriptions ofside effects for DDI into numerical feature vectors, followed by a Random Forest Classifier, Gradient Boosting, BioBERT, Support Vector Machine (SVM) Algorithms to predict the potential interactions between drug pairs. One of the key strength of the Random Forest approach is its ability to provide feature importance scores, which allows us to interpret which side effects are most influential in predicting drug interactions. The key advantage of Gradient Boosting is its high predictive performance combined with interpretability. It is able to handle complex, structured data efficiently. Additionally, the model’s decisions are more transparent, which is necessary in the biomedical domain. The advantageof SVM is its ability to handle high-dimensional data, capture complex non-linear interactions using kernel functions, and generalize with datasets, making it robust to over-fitting. BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining) is advantageous for DDI prediction due to its biomedical domain knowledge, contextual understanding of complex drug-related texts.These method captures the relevance and importance of each side effect of multi drugs and also generate pairs of drugs from the dataset. Our model demonstrates competitive performance in DDI prediction, which highlights the utility of text-based feature extraction combined with an interpretable ensemble learning model.
Medical Neglect, Artificial intelligence, Faculty, Medical, Medical Errors, Education, Medical, Libraries, Medical, Genetics, Medical, modal, Medical genetics, Medical biotechnology, Psychology, Medical, Medical Overuse, Journalism, Medical, Medical ethics, Licensure, Medical, cross-modality, Medical Writing, medical, Electronics, Medical, Economics, Medical, Medical Tourism, Medical Countermeasures, Medical Secretaries, Medical Chaperones
Medical Neglect, Artificial intelligence, Faculty, Medical, Medical Errors, Education, Medical, Libraries, Medical, Genetics, Medical, modal, Medical genetics, Medical biotechnology, Psychology, Medical, Medical Overuse, Journalism, Medical, Medical ethics, Licensure, Medical, cross-modality, Medical Writing, medical, Electronics, Medical, Economics, Medical, Medical Tourism, Medical Countermeasures, Medical Secretaries, Medical Chaperones
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
