Prompt-tuning in ASR systems for efficient domain-adaptation

Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains. Since domain-specific systems perform better than their generic counterparts on in-domain evaluation, the need for memory and compute-efficient domain adaptation is obvious. Particularly, adapting parameter-heavy transformer-based language models used for rescoring ASR hypothesis is challenging. In this work, we overcome the problem using prompt-tuning, a methodology that trains a small number of domain token embedding parameters to prime a transformer-based LM to a particular domain. With just a handful of extra parameters per domain, we achieve much better perplexity scores over the baseline of using an unadapted LM. Despite being parameter-efficient, these improvements are comparable to those of fully-fine-tuned models with hundreds of millions of parameters. We replicate our findings in perplexity numbers to Word Error Rate in a domain-specific ASR system for one such domain.

WeCNLP 2021 camera-ready

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

5 Research products, page 1 of 1

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
2022IsAmongTopNSimilarDocuments
Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer
2022IsAmongTopNSimilarDocuments
Continuous Detection, Rapidly React: Unseen Rumors Detection based on Continual Prompt-Tuning
2022IsAmongTopNSimilarDocuments
PromptEM
2022IsAmongTopNSimilarDocuments
Automating Method Naming with Context-Aware Prompt-Tuning
2023IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average