Efficient Algorithms for Adversarially Robust Approximate Nearest Neighbor Search

📅 2026-01-01
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses approximate nearest neighbor (ANN) search in adaptive adversarial settings, distinguishing between high-dimensional regimes (\(d = \omega(\sqrt{Q})\)) and low-dimensional ones (\(d = O(\sqrt{Q})\)). The paper proposes a novel algorithm that integrates differential privacy, fairness, and locality-sensitive hashing (LSH). Its key contributions include establishing the first theoretical connection between adaptive security and fairness, introducing a concentric-ring LSH structure that overcomes the traditional \(\sqrt{n}\) query time barrier, and incorporating robust decision primitives with metric covering techniques. In high dimensions, the method achieves information-theoretically secure and efficient search; in low dimensions, it guarantees— with high probability—“all-or-nothing” correctness for arbitrary queries, substantially outperforming existing approaches.

Technology Category

Application Category

📝 Abstract
We study the Approximate Nearest Neighbor (ANN) problem under a powerful adaptive adversary that controls both the dataset and a sequence of $Q$ queries. Primarily, for the high-dimensional regime of $d = \omega(\sqrt{Q})$, we introduce a sequence of algorithms with progressively stronger guarantees. We first establish a novel connection between adaptive security and \textit{fairness}, leveraging fair ANN search to hide internal randomness from the adversary with information-theoretic guarantees. To achieve data-independent performance, we then reduce the search problem to a robust decision primitive, solved using a differentially private mechanism on a Locality-Sensitive Hashing (LSH) data structure. This approach, however, faces an inherent $\sqrt{n}$ query time barrier. To break the barrier, we propose a novel concentric-annuli LSH construction that synthesizes these fairness and differential privacy techniques. The analysis introduces a new method for robustly releasing timing information from the underlying algorithm instances and, as a corollary, also improves existing results for fair ANN. In addition, for the low-dimensional regime $d = O(\sqrt{Q})$, we propose specialized algorithms that provide a strong ``for-all''guarantee: correctness on \textit{every} possible query with high probability. We introduce novel metric covering constructions that simplify and improve prior approaches for ANN in Hamming and $\ell_p$ spaces.
Problem

Research questions and friction points this paper is trying to address.

Adversarially Robust
Approximate Nearest Neighbor
Adaptive Adversary
High-dimensional Search
Fairness
Innovation

Methods, ideas, or system contributions that make the work stand out.

adversarially robust
fairness
differential privacy
locality-sensitive hashing
metric covering
🔎 Similar Papers
No similar papers found.
Alexandr Andoni
Alexandr Andoni
Associate Professor, Columbia University
AlgorithmsTheoretical Computer Science
T
Themistoklis Haris
Computer Science Department, Boston University
E
Esty Kelman
Department of Computer Science and Department of Computing & Data Sciences, Boston University and CSAIL, Massachusetts Institute of Technology
Krzysztof Onak
Krzysztof Onak
Boston University
Algorithms