KMS
ATLAS: Adapter-Based Multi-Modal Continual Learning with a Two-Stage Learning Strategy
2024-10-14
状态已发表
摘要While vision-and-language models significantly advance in many fields, the challenge of continual learning is unsolved. Parameter-efficient modules like adapters and prompts present a promising way to alleviate catastrophic forgetting. However, existing works usually learn individual adapters for each task, which may result in redundant knowledge among adapters. Moreover, they continue to use the original pre-trained model to initialize the downstream model, leading to negligible changes in the model's generalization compared to the original model. In addition, there is still a lack of research investigating the consequences of integrating a multi-modal model into the updating procedure for both uni-modal and multi-modal tasks and the subsequent impacts it has on downstream tasks. In this paper, we propose an adapter-based two-stage learning paradigm, a multi-modal continual learning scheme that consists of experience-based learning and novel knowledge expansion, which helps the model fully use experience knowledge and compensate for novel knowledge. Extensive experiments demonstrate that our method is proficient for continual learning. It expands the distribution of representation upstream while also minimizing the negative impact of forgetting previous tasks. Additionally, it enhances the generalization capability for downstream tasks. Furthermore, we incorporate both multi-modal and uni-modal tasks into upstream continual learning. We observe that learning from upstream tasks can help with downstream tasks.
语种英语
DOIarXiv:2410.10923
相关网址查看原文
出处Arxiv
收录类别PPRN.PPRN
WOS记录号PPRN:113088837
WOS类目Computer Science, Artificial Intelligence ; Computer Science, Software Engineering
文献类型预印本
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/446058
专题上海科技大学
通讯作者Huang, Weiran
作者单位
1.Shanghai Jiao Tong Univ, Qing Yuan Res Inst, MIFA Lab, SEIEE, Shanghai, Peoples R China
2.Tsinghua Univ, Dept Math Sci, Beijing, Peoples R China
3.Lin Gang Lab, Shanghai, Peoples R China
4.ShanghaiTech Univ, Shanghai, Peoples R China
推荐引用方式
GB/T 7714
Li, Hong,Tan, Zhiquan,Li, Xingyu,et al. ATLAS: Adapter-Based Multi-Modal Continual Learning with a Two-Stage Learning Strategy. 2024.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Li, Hong]的文章
[Tan, Zhiquan]的文章
[Li, Xingyu]的文章
百度学术
百度学术中相似的文章
[Li, Hong]的文章
[Tan, Zhiquan]的文章
[Li, Xingyu]的文章
必应学术
必应学术中相似的文章
[Li, Hong]的文章
[Tan, Zhiquan]的文章
[Li, Xingyu]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。