Branch coverage prediction in automated testing

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 08 Mar 2019Embargo end date: 19 Jun 2019 Switzerland English Publisher:WileyJournal:Journal of Software: Evolution and Process, volume 31 (issn: 2047-7473, eissn: 2047-7481,

Copyright policy )

Authors: Giovanni Grano; Timofey V. Titov; Sebastiano Panichella; Harald C. Gall;

doi: 10.1002/smr.2158 , 10.21256/zhaw-3221 , 10.5167/uzh-169144

Branch coverage prediction in automated testing

- Summary
- Subjects
- Related research
  (5)
- Metrics

Abstract

AbstractSoftware testing is crucial in continuous integration (CI). Ideally, at every commit, all the test cases should be executed, and moreover, new test cases should be generated for the new source code. This is especially true in a Continuous Test Generation (CTG) environment, where the automatic generation of test cases is integrated into the continuous integration pipeline. In this context, developers want to achieve a certain minimum level of coverage for every software build. However, executing all the test cases and, moreover, generating new ones for all the classes at every commit is not feasible. As a consequence, developers have to select which subset of classes has to be tested and/or targeted by test‐case generation. We argue that knowing a priori the branch coverage that can be achieved with test‐data generation tools can help developers into taking informed decision about those issues. In this paper, we investigate the possibility to use source‐code metrics to predict the coverage achieved by test‐data generation tools. We use four different categories of source‐code features and assess the prediction on a large data set involving more than 3'000 Java classes. We compare different machine learning algorithms and conduct a fine‐grained feature analysis aimed at investigating the factors that most impact the prediction accuracy. Moreover, we extend our investigation to four different search budgets. Our evaluation shows that the best model achieves an average 0.15 and 0.21 MAE on nested cross‐validation over the different budgets, respectively, onEVOSUITEandRANDOOP. Finally, the discussion of the results demonstrate the relevance of coupling‐related features for the prediction accuracy.

Country

Switzerland

Related Organizations

UNIVERSITAET ZUERICH
Switzerland
University of Zurich
Switzerland

Keywords

1712 Software, 10009 Department of Informatics, 005: Computerprogrammierung, Programme und Daten, 000 Computer science, knowledge & systems

5 Research products, page 1 of 1

Eye tracking on software artifact
2014IsPartOf
jdepend software on GitHub
IsRelatedTo
dagger software on GitHub
IsRelatedTo
ck software on GitHub
IsRelatedTo
guava software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	21
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%