COMET: Coverage-guided Model Generation For Deep Learning Library Testing

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 21 Jul 2023Embargo end date: 01 Jan 2022 China (People's Republic of) English Publisher:Association for Computing Machinery (ACM)Journal:ACM Transactions on Software Engineering and Methodology, volume 32, pages 1-34 (issn: 1049-331X, eissn: 1557-7392,

Copyright policy )

Authors: Meiziniu Li; Jialun Cao; Yongqiang Tian 0001; Tsz On Li; Ming Wen 0001; Shing-Chi Cheung;

doi: 10.1145/3583566 , 10.48550/arxiv.2208.01508

arXiv: 2208.01508

COMET: Coverage-guided Model Generation For Deep Learning Library Testing

- Summary
- Subjects
- Metrics

Abstract

Recent deep learning (DL) applications are mostly built on top of DL libraries. The quality assurance of these libraries is critical to the dependable deployment of DL applications. Techniques have been proposed to generate various DL models and apply them to test these libraries. However, their test effectiveness is constrained by the diversity of layer API calls in their generated DL models. Our study reveals that these techniques can cover at most 34.1% layer inputs, 25.9% layer parameter values, and 15.6% layer sequences. As a result, we find that many bugs arising from specific layer API calls (i.e., specific layer inputs, parameter values, or layer sequences) can be missed by existing techniques. Because of this limitation, we propose COMET to effectively generate DL models with diverse layer API calls for DL library testing. COMET: (1) designs a set of mutation operators and a coverage-based search algorithm to diversify layer inputs, layer parameter values, and layer sequences in DL models. (2) proposes a model synthesis method to boost the test efficiency without compromising the layer API call diversity. Our evaluation result shows that COMET outperforms baselines by covering twice as many layer inputs (69.7% vs. 34.1%), layer parameter values (50.2% vs. 25.9%), and layer sequences (39.0% vs. 15.6%) as those by the state-of-the-art. Moreover, COMET covers 3.4% more library branches than those by existing techniques. Finally, COMET detects 32 new bugs in the latest version of eight popular DL libraries, including TensorFlow and MXNet, with 21 of them confirmed by DL library developers and seven of those confirmed bugs have been fixed by developers.

Country

China (People's Republic of)

Related Organizations

Huazhong University of Science and Technology
China (People's Republic of)
The Hong Kong University of Science and Technology (Guangzhou)
China (People's Republic of)
Hong Kong University of Science and Technology
Hong Kong
University of Waterloo
Canada
Guangzhou HKUST Fok Ying Tung Research Institute
China (People's Republic of)

View all View all

Keywords

Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Artificial Intelligence (cs.AI), I.2.5, Computer Science - Artificial Intelligence, D.2.5, D.2.5; I.2.5

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	23
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

23

Top 10%

Green

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all