
The advancement of information technology has introduced new challenges in cybersecurity, especially related to the Android platform which is the main target of malicious software (malware) attacks. The National Cyber and Crypto Agency (BSSN) of Indonesia reported millions of incidents involving Android Package Kit (.apk) files related to electronic wedding invitations. This study aims to develop a robust and efficient static analysis-based machine learning framework for early detection of Android malware. Six machine learning algorithms Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Naive Bayes, AdaBoost, and Gradient Boosting are used to identify malicious behavior in APK files. The dataset used consists of 2,084 Android applications, including 1,314 malware samples and 770 benign applications, obtained through a reverse engineering process. Data pre-processing, feature extraction, and training using supervised learning are carried out to optimize detection accuracy. The experimental results show that the Random Forest algorithm achieves the best performance with 97% accuracy and 95% precision, surpassing the performance of other algorithms.
malware android, machine learning, malware detection, Electrical engineering. Electronics. Nuclear engineering, Reverse engineering, static malware analysis, TK1-9971
malware android, machine learning, malware detection, Electrical engineering. Electronics. Nuclear engineering, Reverse engineering, static malware analysis, TK1-9971
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
