ESG: Elastic Graphs for Range-Filtering Approximate k-Nearest Neighbor Search

πŸ“… 2025-04-05
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper addresses the Range-Filtered Approximate k-Nearest Neighbors (RFAKNN) problem: efficiently retrieving the *k* approximate nearest neighbors to a query vector in high-dimensional space, subject to numerical range constraints. Conventional approaches incur high query costβ€”*O*(logβ€―*N*)β€”due to strict subtree-range matching in index structures. To overcome this, we propose an elastic graph index coupled with a range relaxation strategy. We theoretically prove, for the first time, that any arbitrary query range can be covered by at most two elastic subranges, thereby eliminating the overhead of divide-and-conquer range matching. Our design preserves retrieval accuracy while substantially improving efficiency: on real-world datasets, it achieves 1.5×–6Γ— speedup over state-of-the-art methods, maintaining high recall and precision. This work establishes a new trade-off frontier between accuracy and efficiency for RFAKNN.

Technology Category

Application Category

πŸ“ Abstract
Range-filtering approximate $k$-nearest neighbor (RFAKNN) search takes as input a vector and a numeric value, returning $k$ points from a database of $N$ high-dimensional points. The returned points must satisfy two criteria: their numeric values must lie within the specified query range, and they must be approximately the $k$ nearest points to the query vector. To strike a better balance between query accuracy and efficiency, we propose novel methods that relax the strict requirement for subranges to extit{exactly} match the query range. This elastic relaxation is based on a theoretical insight: allowing the controlled inclusion of out-of-range points during the search does not compromise the bounded complexity of the search process. Building on this insight, we prove that our methods reduce the number of required subranges to at most extit{two}, eliminating the $O(log N)$ query overhead inherent in existing methods. Extensive experiments on real-world datasets demonstrate that our proposed methods outperform state-of-the-art approaches, achieving performance improvements of 1.5x to 6x while maintaining high accuracy.
Problem

Research questions and friction points this paper is trying to address.

Improving accuracy and efficiency in approximate k-nearest neighbor search
Reducing query overhead by relaxing strict range requirements
Enhancing performance while maintaining high search accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Elastic relaxation for range-filtering search
Reduces required subranges to at most two
Improves performance 1.5x to 6x efficiently
πŸ”Ž Similar Papers
No similar papers found.
M
Mingyu Yang
The Hong Kong University of Science and Technology (Guangzhou), China
W
Wentao Li
University of Leicester, UK
Zhitao Shen
Zhitao Shen
Ant Group
databasedata storage
Chuan Xiao
Chuan Xiao
Associate Professor, Osaka University
Agent-Based ModelingComputer SimulationData PreprocessingData ManagementData Science
W
Wei Wang
The Hong Kong University of Science and Technology, China