UNICODE

ContentsThis repository provides the official implementation of the paper 'Enhancing Code Model Robustness Against Identifier Renaming via Unified Code Normalization'. DescriptionDeep code models (DCMs) are increasingly deployed in security-critical applications. Yet, their vulnerability to adversarial perturbations – such as subtle identifier renaming – poses significant risks, as these changes can induce out-of-distribution inputs and cause insecure predictions. A key challenge lies in defending against such attacks without prior knowledge of adversarial patterns, as the space of possible perturbations is virtually infinite, and conventional rule-based defenses fail to generalize. To address this challenge, we primarily focus on defending renaming-based adversarial attacks, which have the most significant impact on DCMs’ security, and propose a novel two-stage defense framework named UniCode, which proactively normalizes adversarial inputs into uniformly in-distribution representations. Please refer to overview.jpg for a detailed overview of our method's structure. Specifically, the first stage strips all identifiers into placeholders, eliminating adversarial influence while maintaining the code structure and functionality, and the second stage reconstructs semantically meaningful identifiers by leveraging contextual understanding from large language models, ensuring the comprehensive code semantics are preserved. By fine-tuning the code models on the normalized distribution, UniCode renders models inherently robust against diverse renaming attacks without requiring attack-specific adaptations. To evaluate the performance of our approach, we have conducted a comprehensive evaluation by comparing it with state-of-the-art baseline methods. The experimental results demonstrate that UniCode achieves the best defense performance on 82.22% of subjects, with average improvements ranging from 9.86% to 46.1% over baselines in terms of defending effectiveness, indicating the superior performance of UniCode.StructureHere, we briefly introduce the usage of each directory in UNICODE:```├─ALERT (Our baseline, for each baseline we provide code on three model/datasets due to space limited)│ ├─codebert│ ├─codet5│ ├─graphcodebert├─CDenoise (Our baseline, CodeDenoise)├─CODA (Our baseline)├─ITGen (Our baseline)├─MARVEL (Our baseline)├─CodeTAE (Our baseline)│─code (Our Approach, UniCode)│ ├─abstract.py (abstracting identifier names)│ ├─replace_method_name.py (abstracting function names)│ ├─normalization.py (conducting code instantiation using LLM)│ ├─build_txt.py (conducting data pre-processing)│ ├─VulnerabilityPrediction_build_jsonl.py (conducting data pre-processing)├─python_parser (parsing code for further analysis)```Datasets/ModelsThe overall model and datasets used in this paper has been listed below. All experimental datasets are provided in data.zip, and the corresponding model weights are stored in the current directory.| Task | Dateset | Train/Val/Test | acc || :----------------------: | :--------: | :----------------: | :----: || Clone Detection | CodeBERT | 90,102/4,000/4,000 | 96.88% || | GCBERT | | 96.73% || | CodeT5 | | 96.40% || | CodeT5Plus | | 97.47% || Vulnerability Prediction | CodeBERT | 21,854/2,732/2,732 | 63.76% || | GCBERT | | 64.13% || | CodeT5 | | 59.99% || | CodeT5Plus | | 58.13% || Defect Prediction | CodeBERT | 27,058/–/6,764 | 84.37% || | GCBERT | | 84.89% || | CodeT5 | | 88.82% || | CodeT5Plus | | 88.99% |Requirements:- python==3.7.7- transformers==4.8.2- pytorch==1.5.1- pandas==1.3.0ReproducibilityUsage InstructionsTo leverage our abstract framework and instantiate methods, follow these steps:Configure Paths Modify the following variables in the code:```input_path = "your/input/path" # Replace with your input directoryoutput_path = "your/output/path" # Specify desired output location```API Key Setup Replace the placeholder with your LLM provider's API key:```api_key = "your_api_key_here" # e.g.deepseek,gpt4-o-mini```Execution Run the core pipeline with:```cd code;python abstract.py --dataset --model python normalization.py --dataset --model ````--dataset` refers to the dataset used for evaluation`--model` refers to the model used for evaluation

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average