Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

📅 2026-03-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited generalization of large language models after subject-centric knowledge editing, where updated knowledge is only accurately recalled under the exact phrasing used during editing and fails to transfer to diverse instruction-following prompts. The authors identify a “covariance trap” caused by covariance constraints that destabilize the geometry of the activation space. To mitigate this, they propose a robust knowledge editing method that minimizes representational drift through isotropic geometric alignment and smooths the optimization landscape via a hierarchical knowledge fusion mechanism, thereby avoiding sharp minima. This approach substantially enhances the model’s ability to generalize edited knowledge across varied prompts, offering a novel pathway toward reliable, interactive parametric memory.

Technology Category

Application Category

📝 Abstract
While locate-then-edit knowledge editing efficiently updates knowledge encoded within Large Language Models (LLMs), a critical generalization failure mode emerges in the practical same-subject knowledge editing scenario: models fail to recall the updated knowledge when following user instructions, despite successfully recalling it in the original edited form. This paper identifies the geometric root of this generalization collapse as a fundamental conflict where the inner activation drifts induced by prompt variations exceed the model's geometric tolerance for generalization after editing. We attribute this instability to a dual pathology: (1) The joint optimization with orthogonal gradients collapses solutions into sharp minima with narrow stability, and (2) the standard covariance constraint paradoxically acts as a Covariance Trap that amplifies input perturbations. To resolve this, we introduce RoSE (Robust Same-subject Editing), which employs Isotropic Geometric Alignment to minimize representational deviation and Hierarchical Knowledge Integration to smooth the optimization landscape. Extensive experiments demonstrate that RoSE significantly improves instruction-following capabilities, laying the foundation for robust interactive parametric memory of LLM agents.
Problem

Research questions and friction points this paper is trying to address.

knowledge editing
generalization
large language models
same-subject editing
instruction following
Innovation

Methods, ideas, or system contributions that make the work stand out.

Same-subject Knowledge Editing
Generalization Collapse
Covariance Trap
Isotropic Geometric Alignment
Hierarchical Knowledge Integration
🔎 Similar Papers
No similar papers found.
X
Xiyu Liu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Q
Qingyi Si
JD.com, Beijing, China
Z
Zhengxiao Liu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Chenxu Yang
Chenxu Yang
Institute of Information Engineering, Chinese Academy of Sciences
NLPDialogue Generation
N
Naibin Gu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China; School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Zheng Lin
Zheng Lin
Institute of Information Engineering, CAS
NLP