descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Jan 2020Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Open Journal of the Computer Society, volume 1, pages 262-275 (eissn: 2644-1268,

Authors: Tzu-Ling Wan; Tao Ban; Shin-Ming Cheng; Yen-Ting Lee; Bo Sun; Ryoichi Isawa; Takeshi Takahashi; +1 Authors

doi: 10.1109/ojcs.2020.3033974

Efficient Detection and Classification of Internet-of-Things Malware Based on Byte Sequences from Executable Files

- Summary
- Subjects
- Related research
  (3)
- Metrics

Abstract

Simple implementation and autonomous operation features make the Internet-of-Things (IoT) vulnerable to malware attacks. Static analysis of IoT malware executable files is a feasible approach to understanding the behavior of IoT malware for mitigation and prevention. However, current analytic approaches based on opcodes or call graphs typically do not work well with diversity in central processing unit (CPU) architectures and are often resource intensive. In this paper, we propose an efficient method for leveraging machine learning methods to detect and classify IoT malware programs. We show that reliable and efficient detection and classification can be achieved by exploring the essential discriminating information stored in the byte sequences at the entry points of executable programs. We demonstrate the performance of the proposed method using a large-scale dataset consisting of 111K benignware and 111K malware programs from seven CPU architectures. The proposed method achieves near optimal generalization performance for malware detection (99.96% accuracy) and for malware family classification (98.47% accuracy). Moreover, when CPU architecture information is considered in learning, the proposed method combined with support vector machine classifiers can yield even higher generalization performance using fewer bytes from the executable files. The findings in this paper are promising for implementing light-weight malware protection on IoT devices with limited resources.

Related Organizations

Keywords

machine learning, static analysis, Computer security, Electronic computers. Computer science, QA75.5-76.95, Information technology, malware analysis, T58.5-58.64, binary code

3 Research products, page 1 of 1

pftools3 software on GitHub
IsRelatedTo
Mirai-Source-Code software on GitHub
IsRelatedTo
radare2 software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	24
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%