Exploiting Abstract Syntax Trees to Locate Software Defects

Name: Exploiting Abstract Syntax Trees to Locate Software Defects
Creator: Shippey, Thomas Joshua
Keywords: metrics, 13. Climate action, bug linking, code features, defect prediction, defects, code snippets, software engineering, software code constructs

Shippey, Thomas Joshua

Found an issue? Give us feedback

University of Hertfo...arrow_drop_down

University of Hertfordshire Research Archive

Doctoral thesis . 2015

Data sources: University of Hertfordshire Research Archive

University of Hertfordshire Research Archive

Doctoral thesis . 2015

Data sources: Datacite

DBLP

Doctoral thesis

Data sources: DBLP

https://dx.doi.org/10.18745/th...

Thesis

Data sources: Microsoft Academic Graph

Exploiting Abstract Syntax Trees to Locate Software Defects

descriptionPublicationkeyboard_double_arrow_right Doctoral thesis , Thesis 26 Aug 2015Embargo end date: 26 Aug 2015 United Kingdom English Publisher:UH Research Archive

Authors: Shippey, Thomas Joshua;

doi: 10.18745/th.16365

handle: 2299/16365

Exploiting Abstract Syntax Trees to Locate Software Defects

- Summary
- Subjects
- Metrics

Abstract

Context. Software defect prediction aims to reduce the large costs involved with faults in a software system. A wide range of traditional software metrics have been evaluated as potential defect indicators. These traditional metrics are derived from the source code or from the software development process. Studies have shown that no metric clearly out performs another and identifying defect-prone code using traditional metrics has reached a performance ceiling. Less traditional metrics have been studied, with these metrics being derived from the natural language of the source code. These newer, less traditional and finer grained metrics have shown promise within defect prediction. Aims. The aim of this dissertation is to study the relationship between short Java constructs and the faultiness of source code. To study this relationship this dissertation introduces the concept of a Java sequence and Java code snippet. Sequences are created by using the Java abstract syntax tree. The ordering of the nodes within the abstract syntax tree creates the sequences, while small sub sequences of this sequence are the code snippets. The dissertation tries to find a relationship between the code snippets and faulty and non-faulty code. This dissertation also looks at the evolution of the code snippets as a system matures, to discover whether code snippets significantly associated with faulty code change over time. Methods. To achieve the aims of the dissertation, two main techniques have been developed; finding defective code and extracting Java sequences and code snippets. Finding defective code has been split into two areas - finding the defect fix and defect insertion points. To find the defect fix points an implementation of the bug-linking algorithm has been developed, called S + e . Two algorithms were developed to extract the sequences and the code snippets. The code snippets are analysed using the binomial test to find which ones are significantly associated with faulty and non-faulty code. These techniques have been performed on five different Java datasets; ArgoUML, AspectJ and three releases of Eclipse.JDT.core Results. There are significant associations between some code snippets and faulty code. Frequently occurring fault-prone code snippets include those associated with identifiers, method calls and variables. There are some code snippets significantly associated with faults that are always in faulty code. There are 201 code snippets that are snippets significantly associated with faults across all five of the systems. The technique is unable to find any significant associations between code snippets and non-faulty code. The relationship between code snippets and faults seems to change as the system evolves with more snippets becoming fault-prone as Eclipse.JDT.core evolved over the three releases analysed. Conclusions. This dissertation has introduced the concept of code snippets into software engineering and defect prediction. The use of code snippets offers a promising approach to identifying potentially defective code. Unlike previous approaches, code snippets are based on a comprehensive analysis of low level code features and potentially allow the full set of code defects to be identified. Initial research into the relationship between code snippets and faults has shown that some code constructs or features are significantly related to software faults. The significant associations between code snippets and faults has provided additional empirical evidence to some already researched bad constructs within defect prediction. The code snippets have shown that some constructs significantly associated with faults are located in all five systems, and although this set is small finding any defect indicators that transfer successfully from one system to another is rare.

Country

United Kingdom

Related Organizations

University of Hertfordshire
United Kingdom

Keywords

metrics, bug linking, code features, defect prediction, defects, code snippets, software engineering, software code constructs

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Beta

SDGs Suggest

13. Climate action

Beta

SDGs:

13. Climate action,

Related to Research communities

Knowmad Institut