Debugging Debug Information With Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2022 Italy Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 10, pages 54,136-54,148 (eissn: 2169-3536,

Copyright policy )

Authors: Artuso F.; Di Luna G. A.; Querzoni L.;

doi: 10.1109/access.2022.3176617

handle: 11573/1639211

Debugging Debug Information With Neural Networks

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

The correctness of debug information included in optimized binaries has been the subject of recent attention by the research community. Indeed, it represents a practically important problem, as most of the software running in production is produced by an optimizing compiler. Current solutions rely on invariants, human-defined rules that embed the desired behavior, whose violation may indicate the presence of a bug. Although this approach proved to be effective in discovering several bugs, it is unable to identify bugs that do not trigger invariants. In this paper, we investigate the feasibility of using Deep Neural Networks (DNNs) to discover incorrect debug information. We trained a set of different models borrowed from the NLP community in an unsupervised way on a large dataset of debug traces and tested their performance on two novel datasets that we propose. Our results are positive and show that DNNs are capable of discovering bugs in both synthetic and real datasets. More interestingly, we performed a live analysis of our models by using them as bug detectors in a fuzzing system. We show that they were able to report 12 unknown bugs in the latest version of the widely used LLVM toolchain, 2 of which have been confirmed.

Country

Italy

Related Organizations

Roma Tre University
Italy
Sapienza University of Rome
Italy
National Institute for Nuclear Physics
Italy

Keywords

General Computer Science, debug information, compilers, General Engineering, General Materials Science, Bugs, Electrical engineering. Electronics. Nuclear engineering, Behavioral sciences; Bugs; Codes; Compilers; Computer bugs; Debug Information; Debugging; Neural Networks; Optimization; Software; Software Engineering; Testing, neural networks, software engineering, TK1-9971

2 Research products, page 1 of 1

NeuroDebug-2_Dataset software on GitHub
IsRelatedTo
yarpgen software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average