A Query-Driven Approach to Space-Efficient Range Searching

📅 2025-02-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the problem of constructing space-efficient partition trees for range search under unknown query distributions accessible only via sampling, aiming to minimize expected query cost—defined as the average number of visited nodes multiplied by per-node processing overhead. We propose a query-driven tree construction paradigm: first, we formulate node partitioning as a geometric separation problem and introduce sparse geometric separators to minimize the expected number of query “piercings”; second, we cast node routing decisions as lightweight classification tasks, accelerated in real time via shallow neural networks. Theoretically, our approach guarantees near-optimal tree construction with near-linear sample complexity. Experiments demonstrate substantial reductions in both average node accesses and end-to-end query latency, yielding significant improvements in overall query efficiency.

Technology Category

Application Category

📝 Abstract
We initiate a study of a query-driven approach to designing partition trees for range-searching problems. Our model assumes that a data structure is to be built for an unknown query distribution that we can access through a sampling oracle, and must be selected such that it optimizes a meaningful performance parameter on expectation. Our first contribution is to show that a near-linear sample of queries allows the construction of a partition tree with a near-optimal expected number of nodes visited during querying. We enhance this approach by treating node processing as a classification problem, leveraging fast classifiers like shallow neural networks to obtain experimentally efficient query times. Our second contribution is to develop partition trees using sparse geometric separators. Our preprocessing algorithm, based on a sample of queries, builds a balanced tree with nodes associated with separators that minimize query stabs on expectation; this yields both fast processing of each node and a small number of visited nodes, significantly reducing query time.
Problem

Research questions and friction points this paper is trying to address.

Designing partition trees for range-searching problems.
Optimizing performance with unknown query distributions.
Reducing query time using sparse geometric separators.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Query-driven partition tree design
Shallow neural networks for classification
Sparse geometric separators optimization
🔎 Similar Papers
No similar papers found.