ASPDup: AST-Sequence-based Progressive Duplicate Code Detection Tool for Onsite Programming Code

Yaoshen Yu; Zhiqiu Huang; Yu Zhou; Weiwei Li; Yichao Shao

Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1145/345791...

Article . 2020 . Peer-reviewed

License: https://www.acm.org/publications/policies/copyright_policy#Background

Data sources: Crossref

https://dx.doi.org/10.1145/345...

Article

Data sources: Microsoft Academic Graph

ASPDup: AST-Sequence-based Progressive Duplicate Code Detection Tool for Onsite Programming Code

descriptionPublicationkeyboard_double_arrow_right Article 01 Nov 2020Publisher:ACMJournal:12th Asia-Pacific Symposium on Internetware

Authors: Yaoshen Yu; Zhiqiu Huang; Yu Zhou; Weiwei Li; Yichao Shao;

doi: 10.1145/3457913.3457938

ASPDup: AST-Sequence-based Progressive Duplicate Code Detection Tool for Onsite Programming Code

- Summary
- Metrics

Abstract

Duplicate code is an example of bad smells, which are usually been refactored after the detection to improve the quality of programs. Locate the duplicate code at the programming phase may reduce the cost of maintenance, but the challenge is it need to detect duplicate code between an incomplete code fragment with complete files, which the existing tools are hard to be applied to this scenario. In this paper, we propose an AST-sequence-based duplicate code detection approach for onsite programming code. The abstract syntax tree (AST) is extracted from source code and then is transformed into an encoded sequence. A local sequence alignment algorithm is used to find highly similar subsequences. After the post-processing, similar regions will be found between two code fragments according to the subsequences. We have developed a prototype tool as a plugin for Visual Studio Code. Experimental results indicate that our approach is effective in finding highly similar regions between cross-granularity code fragments, which can facilitate duplicate code detection for incomplete onsite programming code.

Related Organizations

Nanjing University of Aeronautics and Astronautics
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now