基于语义路由的可逆终身模型编辑LoRA

📄 中文摘要

随着现实世界的动态演变,针对大型语言模型的模型编辑需求日益增加。现有方法虽然探讨了模块化隔离或参数高效策略,但在持续更新过程中仍然面临语义漂移或知识遗忘的问题。为了解决这些挑战,提出了一种基于语义路由的LoRA框架SoLA,实现终身模型编辑。在SoLA中,每次编辑被封装为一个独立的LoRA模块,训练后被冻结,并通过语义路由映射到输入,从而允许通过语义匹配动态激活LoRA模块。这一机制有效避免了由于聚类更新引起的语义漂移,并减轻了参数共享带来的灾难性遗忘。更重要的是,SoLA支持精确的模型编辑,提升了模型的灵活性和适应性。

📄 English Summary

Reversible Lifelong Model Editing via Semantic Routing-Based LoRA

The dynamic evolution of the real world necessitates model editing within Large Language Models. Existing methods, while exploring modular isolation or parameter-efficient strategies, still suffer from semantic drift or knowledge forgetting due to continual updates. To address these challenges, a Semantic routing-based LoRA framework, SoLA, is proposed for lifelong model editing. In SoLA, each edit is encapsulated as an independent LoRA module, which is frozen after training and mapped to input via semantic routing, allowing for dynamic activation of LoRA modules through semantic matching. This mechanism prevents semantic drift caused by cluster updates and mitigates catastrophic forgetting from parameter sharing. Importantly, SoLA supports precise model editing, enhancing the model's flexibility and adaptability.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等