Discriminator-Enhanced Knowledge-Distillation Networks

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 10 Jul 2023 Singapore English Publisher:MDPI AGJournal:Applied Sciences, volume 13, page 8,041 (eissn: 2076-3417,

Copyright policy )

Authors: Zhenping Li; Zhen Cao; Pengfei Li; Yong Zhong; Shaobo Li;

doi: 10.3390/app13148041

handle: 10356/171765

Discriminator-Enhanced Knowledge-Distillation Networks

- Summary
- Subjects
- Metrics

Abstract

Query auto-completion (QAC) serves as a critical functionality in contemporary textual search systems by generating real-time query completion suggestions based on a user’s input prefix. Despite the prevalent use of language models (LMs) in QAC candidate generation, LM-based approaches frequently suffer from overcorrection issues during pair-wise loss training and efficiency deficiencies. To address these challenges, this paper presents a novel framework—discriminator-enhanced knowledge distillation (Dis-KD)—for the QAC task. This framework combines three core components: a large-scale pre-trained teacher model, a lightweight student model, and a discriminator for adversarial learning. Specifically, the discriminator aids in discerning generative-level differences between the teacher and the student models. An additional discriminator score loss is amalgamated with the traditional knowledge-distillation loss, resulting in enhanced performance of the student model. Contrary to the stepwise evaluation of each generated word, our approach assesses the entire generation sequence. This method alleviates the prevalent overcorrection issue in the generation process. Consequently, our proposed framework boasts improvements in model accuracy and a reduction in parameter size. Empirical results highlight the superiority of Dis-KD over established baseline methods, with the student model surpassing the teacher model in QAC tasks for sub-word languages.

Country

Singapore

Related Organizations

Nanyang Technological University
Singapore
Guizhou University
China (People's Republic of)
Chinese Academy of Sciences
China (People's Republic of)
Chinese Academy of Sciences (中国科学院)
China (People's Republic of)
University of Chinese Academy of Sciences
China (People's Republic of)

View all View all

Keywords

reinforcement learning, Technology, Engineering::Electrical and electronic engineering, QH301-705.5, T, Physics, QC1-999, Engineering (General). Civil engineering (General), Reinforcement Learning, Chemistry, knowledge distillation, query auto-completion, TA1-2040, Biology (General), QD1-999, Knowledge Distillation

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

gold

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering