Two Sides of the Same Coin: Exploiting the Impact of Identifiers in Neural Code Comprehension

Name: Two Sides of the Same Coin: Exploiting the Impact of Identifiers in Neural Code Comprehension
Keywords: Source codes, FOS: Computer and information sciences, Neural code, F1 scores, Multitask learning, Model prediction, 330, Code comprehension, Comprehension models, Software Engineering

Shuzheng Gao; Cuiyun Gao 0001; Chaozheng Wang; Jun Sun 0001; David Lo 0001; Yue Yu 0001

Found an issue? Give us feedback

downloadFull-Text

Institutional Knowle...arrow_drop_down

Institutional Knowledge (InK) at Singapore Management University

Article . 2023

License: CC BY NC ND

Full-Text: https://ink.library.smu.edu.sg/sis_research/9270

Data sources: Bielefeld Academic Search Engine (BASE)

arXiv.org e-Print Archive

Preprint . 2022

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1109/icse48...

Article . 2023 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2022

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

Two Sides of the Same Coin: Exploiting the Impact of Identifiers in Neural Code Comprehension

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 May 2023Embargo end date: 01 Jan 2022 Singapore Publisher:IEEEJournal:2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

Authors: Shuzheng Gao; Cuiyun Gao 0001; Chaozheng Wang; Jun Sun 0001; David Lo 0001; Yue Yu 0001;

doi: 10.1109/icse48619.2023.00164 , 10.48550/arxiv.2207.11104

arXiv: 2207.11104

Two Sides of the Same Coin: Exploiting the Impact of Identifiers in Neural Code Comprehension

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

Previous studies have demonstrated that neural code comprehension models are vulnerable to identifier naming. By renaming as few as one identifier in the source code, the models would output completely irrelevant results, indicating that identifiers can be misleading for model prediction. However, identifiers are not completely detrimental to code comprehension, since the semantics of identifier names can be related to the program semantics. Well exploiting the two opposite impacts of identifiers is essential for enhancing the robustness and accuracy of neural code comprehension, and still remains under-explored. In this work, we propose to model the impact of identifiers from a novel causal perspective, and propose a counterfactual reasoning-based framework named CREAM. CREAM explicitly captures the misleading information of identifiers through multi-task learning in the training stage, and reduces the misleading impact by counterfactual inference in the inference stage. We evaluate CREAM on three popular neural code comprehension tasks, including function naming, defect detection and code classification. Experiment results show that CREAM not only significantly outperforms baselines in terms of robustness (e.g., +37.9% on the function naming task at F1 score), but also achieve improved results on the original datasets (e.g., +0.5% on the function naming task at F1 score).

Accepted to ICSE'2023

Country

Singapore

Related Organizations

Harbin Institute of Technology
China (People's Republic of)
Peng Cheng Laboratory
China (People's Republic of)
National University of Defense Technolog
National University of Defense Technolog
China (People's Republic of)
National University of Defense Technology

View all View all

Keywords

Source codes, FOS: Computer and information sciences, Neural code, F1 scores, Multitask learning, Model prediction, 330, Code comprehension, Comprehension models, Software Engineering, 004, Software Engineering (cs.SE), Computer Science - Software Engineering, Counterfactuals, Program semantics, Misleading informations

2 Research products, page 1 of 1

tree-sitter software on GitHub
IsRelatedTo
CREAM software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average