FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems

๐Ÿ“… 2025-06-10
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

198K/year
๐Ÿค– AI Summary
Current RAG systems lack a unified framework supporting end-to-end, joint fine-tuning of retrievers and generators under both centralized and federated architectures, while existing toolchains struggle to accommodate privacy-sensitive and data-isolated scenarios. This paper introduces FedRAGโ€”the first open-source framework enabling dual-mode (centralized/federated) end-to-end RAG fine-tuning, jointly optimizing dual-encoder retrievers and generator models via supervised fine-tuning. Its key contributions are: (1) a unified architectural abstraction with one-click mode switching; (2) deep integration of mainstream RAG libraries (e.g., LlamaIndex, Haystack), enabling modular, customizable pipelines; and (3) significant improvements in factual consistency and response quality across cross-domain, low-resource, and distributed settings. Experiments demonstrate FedRAGโ€™s superior training efficiency and robust performance across multiple benchmarks, while seamlessly integrating with existing RAG workflows.

Technology Category

Application Category

๐Ÿ“ Abstract
Retrieval-augmented generation (RAG) systems have been shown to be effective in addressing many of the drawbacks of relying solely on the parametric memory of large language models. Recent work has demonstrated that RAG systems can be improved via fine-tuning of their retriever and generator models. In this work, we introduce FedRAG, a framework for fine-tuning RAG systems across centralized and federated architectures. FedRAG supports state-of-the-art fine-tuning methods, offering a simple and intuitive interface and a seamless conversion from centralized to federated training tasks. FedRAG is also deeply integrated with the modern RAG ecosystem, filling a critical gap in available tools.
Problem

Research questions and friction points this paper is trying to address.

Fine-tuning RAG systems across centralized and federated architectures
Improving retriever and generator models in RAG systems
Integrating modern RAG ecosystem with fine-tuning tools
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fine-tunes RAG systems across architectures
Supports state-of-the-art fine-tuning methods
Integrates deeply with modern RAG ecosystem
๐Ÿ”Ž Similar Papers
No similar papers found.