剪枝式遗忘中的概念复兴风险：揭示扩散模型下的根源

出处: Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

发布: 2026年3月10日

📄 中文摘要

剪枝式遗忘作为一种快速、无训练且与数据无关的方法，近年来在扩散模型中被广泛应用于去除不必要的概念。该方法以其高效性和鲁棒性，成为传统微调或编辑式遗忘的有吸引力的替代方案。然而，研究发现这一前景光明的范式背后潜藏着隐患。剪枝过程中被置零的权重位置可能成为侧信道信号，泄露被删除概念的关键信息。为验证这一脆弱性，设计了一种新颖的攻击框架，能够在完全无数据和无训练的情况下，从剪枝后的扩散模型中复兴被删除的概念。实验结果确认了这一风险的存在。

🏷️ 相关标签

#剪枝式遗忘 #扩散模型 #概念复兴 #侧信道信号 #信息泄露

📄 English Summary

Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

Pruning-based unlearning has emerged as a fast, training-free, and data-independent method for removing unwanted concepts from diffusion models. This approach promises high efficiency and robustness, making it an attractive alternative to traditional fine-tuning or editing-based unlearning. However, a hidden danger has been uncovered within this promising paradigm. The locations of pruned weights, typically set to zero during the unlearning process, can act as side-channel signals that leak critical information about the erased concepts. To verify this vulnerability, a novel attack framework has been designed, capable of reviving erased concepts from pruned diffusion models in a fully data-free and training-free manner. Experimental results confirm the existence of this risk.

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等

📄 中文摘要

🏷️ 相关标签

📄 English Summary

Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

🏷️ Related Tags

📚 相关文章

AI 编程创造了新一类创作者。我就是其中之一。

人工智能成为我学习的助手

Claude CLI "泄露": 没有人赢，AI 仍然幻觉，企业仍在犯同样的错误