CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:ZenodoJournal:CoRR, volume abs/2408.14572

Authors: Muhammad Fawi;

doi: 10.48550/arxiv.2408.14572 , 10.5281/zenodo.12730055 , 10.5281/zenodo.13376790

arXiv: 2408.14572

CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation

- Summary
- Subjects
- Metrics

Abstract

This paper introduces CURLoRA, a novel approach to fine-tuning large language models (LLMs) that leverages CUR matrix decomposition in the context of Low-Rank Adaptation (LoRA). Our method addresses two critical challenges in LLM fine-tuning: mitigating catastrophic forgetting during continual learning and reducing the number of trainable parameters. We propose a unique modification to the CUR decomposition process, utilizing inverted probabilities for column and row selection which acts as an implicit regularization, and initializing the $U$ matrix as a zero matrix, and only fine-tuning it. We demonstrate through experiments on multiple datasets that CURLoRA outperforms standard LoRA in mitigating catastrophic forgetting. It maintains model stability and performance across tasks while significantly reducing the number of trainable parameters. Our results show that CURLoRA achieves very good and stable task accuracy while maintaining base model's perplexity scores fixed compared to LoRA upon continual fine-tuning, particularly in scenarios with limited data.

Code available at https://github.com/MNoorFawi/curlora

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial intelligence, Computer Science - Computation and Language, Large Language Models, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Low-Rank Approximation, Catastrophic Forgetting, CUR Matrix Decomposition, Computation and Language (cs.CL), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

Knowmad Institut