descriptionPublicationkeyboard_double_arrow_right Article , Preprint 10 Oct 2022Embargo end date: 01 Jan 2022Publisher:ACMJournal:Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering

Authors: Qing Huang; Zhiqiang Yuan; Zhenchang Xing; Xiwei Xu; Liming Zhu; Qinghua Lu;

doi: 10.1145/3551349.3556912 , 10.48550/arxiv.2208.05361

arXiv: 2208.05361

Prompt-tuned Code Language Model as a Neural Knowledge Base for Type Inference in Statically-Typed Partial Code

- Summary
- Subjects
- Related research
  (5)
- Metrics

Abstract

Partial code usually involves non-fully-qualified type names (non-FQNs) and undeclared receiving objects. Resolving the FQNs of these non-FQN types and undeclared receiving objects (referred to as type inference) is the prerequisite to effective search and reuse of partial code. Existing dictionary-lookup based methods build a symbolic knowledge base of API names and code contexts, which involve significant compilation overhead and are sensitive to unseen API names and code context variations. In this paper, we formulate type inference as a cloze-style fill-in-blank language task. Built on source code naturalness, our approach fine-tunes a code masked language model (MLM) as a neural knowledge base of code elements with a novel "pre-train, prompt and predict" paradigm from raw source code. Our approach is lightweight and has minimum requirements on code compilation. Unlike existing symbolic name and context matching for type inference, our prompt-tuned code MLM packs FQN syntax and usage in its parameters and supports fuzzy neural type inference. We systematically evaluate our approach on a large amount of source code from GitHub and Stack Overflow. Our results confirm the effectiveness of our approach design and the practicality for partial code type inference. As the first of its kind, our neural type inference method opens the door to many innovative ways of using partial code.

The submitted paper has been accepted by ASE 2022. If possible, please expedite the approval process. Thank you very much

Related Organizations

Commonwealth Science and Industrial Research Organisation, Oceans and Atmosphere
Australia
Jiangxi Normal University
China (People's Republic of)
Australian National University
Australia
Guangxi Normal University
China (People's Republic of)
Commonwealth Scientific and Industrial Research Organisation
Australia

View all View all

Keywords

Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering

5 Research products, page 1 of 1

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
2022IsAmongTopNSimilarDocuments
Continuous Detection, Rapidly React: Unseen Rumors Detection based on Continual Prompt-Tuning
2022IsAmongTopNSimilarDocuments
Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer
2022IsAmongTopNSimilarDocuments
Automating Method Naming with Context-Aware Prompt-Tuning
2023IsAmongTopNSimilarDocuments
PromptEM
2022IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	34
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

Top 10%

Top 1%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Prompt-tuned Code Language Model as a Neural Knowledge Base for Type Inference in Statically-Typed Partial Code

Prompt-tuned Code Language Model as a Neural Knowledge Base for Type Inference in Statically-Typed Partial Code

5 Research products, page 1 of 1

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

Continuous Detection, Rapidly React: Unseen Rumors Detection based on Continual Prompt-Tuning

Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer

Automating Method Naming with Context-Aware Prompt-Tuning

PromptEM