Trans-RAG: Query-Centric Vector Transformation for Secure Cross-Organizational Retrieval

📅 2026-04-10
📈 Citations: 0
Influential: 0
📄 PDF

career value

194K/year
🤖 AI Summary
This study addresses the challenge of simultaneously achieving security, accuracy, and efficiency in cross-organizational Retrieval-Augmented Generation (RAG) systems, where existing encrypted approaches require plaintext exposure upon decryption and federated methods suffer from fragmented resources and high overhead. The authors propose a secure retrieval framework grounded in a vector-space linguistic paradigm, introducing a novel query-centric multi-stage vector transformation mechanism (vector2Trans). This approach constructs a near-orthogonal semantic space and enables cross-organizational semantic alignment, facilitating efficient ciphertext retrieval without decryption. Experimental results across eight retrievers, three datasets, and three large language models demonstrate that the method incurs only a 3.5% drop in nDCG@10 while achieving a 99.81% isolation rate and an 89.90° inter-space angle, significantly outperforming homomorphic encryption schemes in computational efficiency.

Technology Category

Application Category

📝 Abstract
Retrieval Augmented Generation (RAG) systems deployed across organizational boundaries face fundamental tensions between security, accuracy, and efficiency. Current encryption methods expose plaintext during decryption, while federated architectures prevent resource integration and incur substantial overhead. We introduce Trans-RAG, implementing a novel vector space language paradigm where each organization's knowledge exists in a mathematically isolated semantic space. At the core lies vector2Trans, a multi-stage transformation technique that enables queries to dynamically"speak"each organization's vector space"language"through query-centric transformations, eliminating decryption overhead while maintaining native retrieval efficiency. Security evaluations demonstrate near-orthogonal vector spaces with 89.90{\deg} angular separation and 99.81% isolation rates. Experiments across 8 retrievers, 3 datasets, and 3 LLMs show minimal accuracy degradation (3.5% decrease in nDCG@10) and significant efficiency improvements over homomorphic encryption.
Problem

Research questions and friction points this paper is trying to address.

Retrieval Augmented Generation
cross-organizational retrieval
security
efficiency
vector space
Innovation

Methods, ideas, or system contributions that make the work stand out.

Trans-RAG
vector space transformation
secure cross-organizational retrieval
query-centric adaptation
semantic space isolation
🔎 Similar Papers
No similar papers found.
Y
Yu Liu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
K
Kun Peng
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Wenxiao Zhang
Wenxiao Zhang
University of Western Australia
LLMRoboticsEmbodied AISecurity
F
Fangfang Yuan
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
C
Cong Cao
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
W
Wenxuan Lu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Y
Yanbing Liu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China