AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

📅 2024-04-12
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address the triple challenges of knowledge updating, access control, and verifiable forgetting faced by large language models (LLMs) in dynamic data environments, this paper proposes a modular knowledge management framework based on dynamic adapter swapping. The method integrates low-rank adaptation (LoRA), incremental training, access-policy embedding, and forgetting verification to enable runtime, on-demand composition and fine-grained governance of knowledge. Key contributions include: (i) the first support for millisecond-level, user-specific data view switching; (ii) document-level verifiable knowledge deletion—achieving near-zero recall post-deletion; and (iii) negligible performance degradation (<1.2% accuracy drop) on prior tasks, substantially mitigating catastrophic forgetting. Experiments demonstrate that the framework simultaneously preserves model stability, ensures data controllability and update efficiency, and satisfies privacy compliance requirements—establishing a novel paradigm for continual learning and trustworthy LLM deployment.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) are increasingly capable of completing knowledge intensive tasks by recalling information from a static pretraining corpus. Here we are concerned with LLMs in the context of evolving data requirements. For instance: batches of new data that are introduced periodically; subsets of data with user-based access controls; or requirements on dynamic removal of documents with guarantees that associated knowledge cannot be recalled. We wish to satisfy these requirements while at the same time ensuring a model does not forget old information when new data becomes available. To address these issues, we introduce AdapterSwap, a training and inference scheme that organizes knowledge from a data collection into a set of low-rank adapters, which are dynamically composed during inference. Our experiments demonstrate AdapterSwap's ability to support efficient continual learning, while also enabling organizations to have fine-grained control over data access and deletion.
Problem

Research questions and friction points this paper is trying to address.

continuous training of LLMs
data removal guarantees
access-control guarantees
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic low-rank adapter composition
Efficient continual learning support
Fine-grained data access control
🔎 Similar Papers
No similar papers found.