ELASTIC: Energy-based latent space alignment with second-order influences of memory selection for continual learning
Author affiliations
DOI:
https://doi.org/10.15625/1813-9663/22071Keywords:
Replay-based continual learning, influence functions, energy-based model.Abstract
Continual learning models encounter a significant challenge, namely catastrophic forgetting. After training on a new task they forget previously learned knowledge, causing a significant drop in models’ performance on previous tasks. Replay-based methods, the most efficient approach for addressing catastrophic forgetting, select a small number of data (the coreset) from each learned task to store into an episodic memory buffer for rehearsal during subsequent stages. This selection significantly impacts the models’ ability to perform well on new tasks while maintaining performance on previous tasks. In this paper, we propose a two-phase method to address catastrophic forgetting in continual learning. The first phase utilizes the second-order influence function to select an effective coreset from previously learned tasks. Even with this effective selection, there is still a problem that the hidden feature space is unexpectedly transformed across each task, causing the model to forget optimal hidden representations on previously learned tasks. To address this issue, the second phase employs an energy-based latent aligner (ELI) to re-align the hidden feature representations of tasks towards the optimal region. Extensive experiments on three continual learning benchmark datasets, i.e. CIFAR-10, CIFAR-100, and Split miniImageNet, demonstrate that our proposed method outperforms several existing state-of-the-art continual learning models.
Downloads
Published
How to Cite
Issue
Section
License
1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.
						
			
		
			
			
				
