
pmid: 36449592
In the task incremental learning problem, deep learning models suffer from catastrophic forgetting of previously seen classes/tasks as they are trained on new classes/tasks. This problem becomes even harder when some of the test classes do not belong to the training class set, i.e., the task incremental generalized zero-shot learning problem. We propose a novel approach to address the task incremental learning problem for both the non zero-shot and zero-shot settings. Our proposed approach, called Rectification-based Knowledge Retention (RKR), applies weight rectifications and affine transformations for adapting the model to any task. During testing, our approach can use the task label information (task-aware) to quickly adapt the network to that task. We also extend our approach to make it task-agnostic so that it can work even when the task label information is not available during testing. Specifically, given a continuum of test data, our approach predicts the task and quickly adapts the network to the predicted task. We experimentally show that our proposed approach achieves state-of-the-art results on several benchmark datasets for both non zero-shot and zero-shot task incremental learning.
Continual Learning, /dk/atira/pure/subjectarea/asjc/1700/1702; name=Artificial Intelligence, /dk/atira/pure/subjectarea/asjc/1700/1703; name=Computational Theory and Mathematics, Deep Learning, Image Classification, Generalized Zero-Shot Classification, Task Incremental Learning, /dk/atira/pure/subjectarea/asjc/2600/2604; name=Applied Mathematics, /dk/atira/pure/subjectarea/asjc/1700/1712; name=Software, /dk/atira/pure/subjectarea/asjc/1700/1707; name=Computer Vision and Pattern Recognition
Continual Learning, /dk/atira/pure/subjectarea/asjc/1700/1702; name=Artificial Intelligence, /dk/atira/pure/subjectarea/asjc/1700/1703; name=Computational Theory and Mathematics, Deep Learning, Image Classification, Generalized Zero-Shot Classification, Task Incremental Learning, /dk/atira/pure/subjectarea/asjc/2600/2604; name=Applied Mathematics, /dk/atira/pure/subjectarea/asjc/1700/1712; name=Software, /dk/atira/pure/subjectarea/asjc/1700/1707; name=Computer Vision and Pattern Recognition
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
