Approximate Nearest Neighbor Search for Modern AI: A Projection-Augmented Graph Approach

๐Ÿ“… 2026-03-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing approximate nearest neighbor (ANN) search methods struggle to simultaneously achieve high query efficiency, fast indexing, low memory consumption, scalability to high dimensions, robustness across varying data scales, and support for online insertionsโ€”key requirements in modern AI applications. This work proposes the Projection-Augmented Graph (PAG) framework, which uniquely integrates projection mechanisms into graph-based indexing. By leveraging statistically grounded asymmetric distance comparisons, PAG drastically reduces the need for exact distance computations while jointly optimizing index construction and query processing within a unified architecture. The framework natively supports online insertions and, across six real-world datasets, demonstrates up to 5ร— higher QPS-recall performance than HNSW, while maintaining efficient indexing, moderate memory overhead, and consistent robustness in both high-dimensional and large-scale settings.

Technology Category

Application Category

๐Ÿ“ Abstract
Approximate Nearest Neighbor Search (ANNS) is fundamental to modern AI applications. Most existing solutions optimize query efficiency but fail to align with the practical requirements of modern workloads. In this paper, we outline six critical demands of modern AI applications: high query efficiency, fast indexing, low memory footprint, scalability to high dimensionality, robustness across varying retrieval sizes, and support for online insertions. To satisfy all these demands, we introduce Projection-Augmented Graph (PAG), a new ANNS framework that integrates projection techniques into a graph index. PAG reduces unnecessary exact distance computations through asymmetric comparisons between exact and approximate distances as guided by projection-based statistical tests. Three key components are designed and unified to the graph index to optimize indexing and searching. Experiments on six modern datasets demonstrate that PAG consistently achieves superior query per second (QPS)-recall performance -- up to 5x faster than HNSW -- while offering fast indexing speed and moderate memory footprint. PAG remains robust as dimensionality and retrieval size increase and naturally supports online insertions.
Problem

Research questions and friction points this paper is trying to address.

Approximate Nearest Neighbor Search
modern AI applications
query efficiency
high dimensionality
online insertions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Approximate Nearest Neighbor Search
Projection-Augmented Graph
Asymmetric Distance Comparison
Online Insertion
High-Dimensional Scalability
๐Ÿ”Ž Similar Papers
No similar papers found.