AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees

📅 2024-04-12

🏛️ arXiv.org

📈 Citations: 1

✨ Influential: 0

career value

156K/year

🤖 AI Summary

To address the triple challenges of knowledge updating, access control, and verifiable forgetting faced by large language models (LLMs) in dynamic data environments, this paper proposes a modular knowledge management framework based on dynamic adapter swapping. The method integrates low-rank adaptation (LoRA), incremental training, access-policy embedding, and forgetting verification to enable runtime, on-demand composition and fine-grained governance of knowledge. Key contributions include: (i) the first support for millisecond-level, user-specific data view switching; (ii) document-level verifiable knowledge deletion—achieving near-zero recall post-deletion; and (iii) negligible performance degradation (<1.2% accuracy drop) on prior tasks, substantially mitigating catastrophic forgetting. Experiments demonstrate that the framework simultaneously preserves model stability, ensures data controllability and update efficiency, and satisfies privacy compliance requirements—establishing a novel paradigm for continual learning and trustworthy LLM deployment.

Technology Category

Application Category

📝 Abstract

Large language models (LLMs) are increasingly capable of completing knowledge intensive tasks by recalling information from a static pretraining corpus. Here we are concerned with LLMs in the context of evolving data requirements. For instance: batches of new data that are introduced periodically; subsets of data with user-based access controls; or requirements on dynamic removal of documents with guarantees that associated knowledge cannot be recalled. We wish to satisfy these requirements while at the same time ensuring a model does not forget old information when new data becomes available. To address these issues, we introduce AdapterSwap, a training and inference scheme that organizes knowledge from a data collection into a set of low-rank adapters, which are dynamically composed during inference. Our experiments demonstrate AdapterSwap's ability to support efficient continual learning, while also enabling organizations to have fine-grained control over data access and deletion.

Problem

Research questions and friction points this paper is trying to address.

continuous training of LLMs

data removal guarantees

access-control guarantees

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic low-rank adapter composition

Efficient continual learning support

Fine-grained data access control

🔎 Similar Papers

Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions