Downloads provided by UsageCounts
Abstract - With the advancement in the Information Technology and by the use of various Machine Learning techniques several models were built for predicting DM but majority of the algorithms exhibited an accuracy rate of 70%- 90%. This clearly proclaims that still there is a need to build an efficient model capable of classifying distinctly. This paper aims at classifying the subjects into Diabetic and Non-Diabetic classes using the dataset drawn from the National Institute of Diabetic and Digestive and Kidney disease. SMOTE, an oversampling technique which overcomes the class imbalance problem is experimented on the dataset such that the classification dataset does not have a skewed proportion. The class balanced dataset is trained using the XGBoost algorithm, an ensemble technique akin to decision tree that makes use of Gradient Boosting framework out-turn an accuracy score of 97%..
Keywords – Diabetes Mellitus, XGBoost, SMOTE (Synthetic Minority Oversampling Technique)
Keywords – Diabetes Mellitus, XGBoost, SMOTE (Synthetic Minority Oversampling Technique)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 18 | |
| downloads | 17 |

Views provided by UsageCounts
Downloads provided by UsageCounts