Rectification-Based Knowledge Retention for Task Incremental Learning

descriptionPublicationkeyboard_double_arrow_right Article 01 Mar 2024 United Kingdom Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 46, pages 1,561-1,575 (issn: 0162-8828, eissn: 1939-3539,

Copyright policy )

Authors: Pratik Mazumder; Pravendra Singh; Piyush Rai; Vinay P. Namboodiri;

doi: 10.1109/tpami.2022.3225310

pmid: 36449592

Rectification-Based Knowledge Retention for Task Incremental Learning

- Summary
- Subjects
- Metrics

Abstract

In the task incremental learning problem, deep learning models suffer from catastrophic forgetting of previously seen classes/tasks as they are trained on new classes/tasks. This problem becomes even harder when some of the test classes do not belong to the training class set, i.e., the task incremental generalized zero-shot learning problem. We propose a novel approach to address the task incremental learning problem for both the non zero-shot and zero-shot settings. Our proposed approach, called Rectification-based Knowledge Retention (RKR), applies weight rectifications and affine transformations for adapting the model to any task. During testing, our approach can use the task label information (task-aware) to quickly adapt the network to that task. We also extend our approach to make it task-agnostic so that it can work even when the task label information is not available during testing. Specifically, given a continuum of test data, our approach predicts the task and quickly adapts the network to the predicted task. We experimentally show that our proposed approach achieves state-of-the-art results on several benchmark datasets for both non zero-shot and zero-shot task incremental learning.

Country

United Kingdom

Related Organizations

University of Bath
United Kingdom

Keywords

Continual Learning, /dk/atira/pure/subjectarea/asjc/1700/1702; name=Artificial Intelligence, /dk/atira/pure/subjectarea/asjc/1700/1703; name=Computational Theory and Mathematics, Deep Learning, Image Classification, Generalized Zero-Shot Classification, Task Incremental Learning, /dk/atira/pure/subjectarea/asjc/2600/2604; name=Applied Mathematics, /dk/atira/pure/subjectarea/asjc/1700/1712; name=Software, /dk/atira/pure/subjectarea/asjc/1700/1707; name=Computer Vision and Pattern Recognition

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering