NeuRI: Diversifying DNN Generation via Inductive Rule Inference

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 30 Nov 2023Embargo end date: 01 Jan 2023Publisher:ACMJournal:Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software EngineeringFunded by:NSF | SHF: Medium: Collaborativ..., NSF | CAREER: Maximal and Scala...

Authors: Jiawei Liu 0004; Jinjun Peng; Yuyao Wang; Lingming Zhang 0001;

doi: 10.1145/3611643.3616337 , 10.48550/arxiv.2302.02261

arXiv: 2302.02261

NeuRI: Diversifying DNN Generation via Inductive Rule Inference

- Summary
- Subjects
- Related research
  (5)
- Metrics

Abstract

Deep Learning (DL) is prevalently used in various industries to improve decision-making and automate processes, driven by the ever-evolving DL libraries and compilers. The correctness of DL systems is crucial for trust in DL applications. As such, the recent wave of research has been studying the automated synthesis of test-cases (i.e., DNN models and their inputs) for fuzzing DL systems. However, existing model generators only subsume a limited number of operators, lacking the ability to pervasively model operator constraints. To address this challenge, we propose NeuRI, a fully automated approach for generating valid and diverse DL models composed of hundreds of types of operators. NeuRI adopts a three-step process: (i) collecting valid and invalid API traces from various sources; (ii) applying inductive program synthesis over the traces to infer the constraints for constructing valid models; and (iii) using hybrid model generation which incorporates both symbolic and concrete operators. Our evaluation shows that NeuRI improves branch coverage of TensorFlow and PyTorch by 24% and 15% over the state-of-the-art model-level fuzzers. NeuRI finds 100 new bugs for PyTorch and TensorFlow in four months, with 81 already fixed or confirmed. Of these, 9 bugs are labelled as high priority or security vulnerability, constituting 10% of all high-priority bugs of the period. Open-source developers regard error-inducing tests reported by us as "high-quality" and "common in practice".

Related Organizations

Hebei University
China (People's Republic of)
Nanjing University
China (People's Republic of)
Nanjing University
China (People's Republic of)
Nanjing University
China (People's Republic of)
King’s University
United States

View all View all

Keywords

Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Computer Science - Machine Learning, Machine Learning (cs.LG)

5 Research products, page 1 of 1

pytorch software on GitHub
IsRelatedTo
tensorflow software on GitHub
IsRelatedTo
pytorch software on GitHub
IsRelatedTo
pytorch software on GitHub
IsRelatedTo
neuri-artifact software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	16
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%