Recently the prompt-tuning paradigm has attracted significant attention. By only tuning continuous prompts with a frozen pre-trained language model (PLM), prompt-tuning takes a step towards deploying a shared frozen PLM to serve numerous downstream tasks. Although prompt-tuning shows good performance on certain natural language understanding (NLU) tasks, its effectiveness on natural language generation (NLG) tasks is still under-explored. In this paper, we argue that one of the factors hindering the development of prompt-tuning on NLG tasks is the unfamiliar inputs (i.e., inputs are linguistically different from the pretraining corpus). For example, our preliminary exploration reveals a large performance gap between prompt-tuning and fine-tuning when unfamiliar inputs occur frequently in NLG tasks. This motivates us to propose input-tuning, which fine-tunes both the continuous prompts and the input representations, leading to a more effective way to adapt unfamiliar inputs to frozen PLMs. Our proposed input-tuning is conceptually simple and empirically powerful. Experimental results on seven NLG tasks demonstrate that input-tuning is significantly and consistently better than prompt-tuning. Furthermore, on three of these tasks, input-tuning can achieve a comparable or even better performance than fine-tuning.

Related Organizations

Peking University
China (People's Republic of)
Xi’an Jiaotong-Liverpool University
China (People's Republic of)
Hebei University
China (People's Republic of)
Peking University
China (People's Republic of)
Microsoft Research Asia (China)
China (People's Republic of)

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL)

10 Research products, page 1 of 1

Prompt-tuned Code Language Model as a Neural Knowledge Base for Type Inference in Statically-Typed Partial Code
2022IsAmongTopNSimilarDocuments
Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer
2022IsAmongTopNSimilarDocuments
PanDa: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
2024IsAmongTopNSimilarDocuments
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
2022IsAmongTopNSimilarDocuments
Enhancing Entity Representations with Prompt Learning for Biomedical Entity Linking
2022IsAmongTopNSimilarDocuments
Prompt-tuning in ASR systems for efficient domain-adaptation
2021IsAmongTopNSimilarDocuments
Continuous Detection, Rapidly React: Unseen Rumors Detection based on Continual Prompt-Tuning
2022IsAmongTopNSimilarDocuments
Automating Method Naming with Context-Aware Prompt-Tuning
2023IsAmongTopNSimilarDocuments
PromptEM
2022IsAmongTopNSimilarDocuments
Long: Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
2022IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green

Fields of Science (4) View all

natural sciences

Fields of Science

natural sciences

View all