Optimal Static Dictionary with Worst-Case Constant Query Time

📅 2024-12-14
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF

career value

200K/year
🤖 AI Summary
This work addresses the long-standing trade-off between space efficiency and deterministic query performance in static dictionaries. We propose a novel data structure achieving near-information-theoretic optimal space—namely, OPT + n^ε bits—and worst-case O(1) query time. Our construction integrates hierarchical hashing, combinatorial design, precomputed lookup tables, and information-theoretic encoding, augmented by randomized construction techniques. For the full universe [U] and value domain [σ], it is the first to simultaneously achieve OPT + polylog n bits of space and strictly constant-time queries. In contrast to prior approaches—such as Patrascu (2008) and Yu (2020)—which only guarantee expected constant-time queries or incur larger space overhead, our result breaks the conventional impossibility barrier between deterministic low-latency access and near-optimal space usage. This establishes a new theoretical and practical benchmark for static dictionary design.

Technology Category

Application Category

📝 Abstract
In this paper, we design a new succinct static dictionary with worst-case constant query time. A dictionary data structure stores a set of key-value pairs with distinct keys in $[U]$ and values in $[sigma]$, such that given a query $xin [U]$, it quickly returns if $x$ is one of the input keys, and if so, also returns its associated value. The textbook solution to dictionaries is hash tables. On the other hand, the (information-theoretical) optimal space to encode such a set of key-value pairs is only $ ext{OPT} := loginom{U}{n}+nlog sigma$. We construct a dictionary that uses $ ext{OPT} + n^{epsilon}$ bits of space, and answers queries in constant time in worst case. Previously, constant-time dictionaries are only known with $ ext{OPT} + n/ ext{poly}log n$ space [Pv{a}trac{s}cu 2008], or with $ ext{OPT}+n^{epsilon}$ space but expected constant query time [Yu 2020]. We emphasize that most of the extra $n^{epsilon}$ bits are used to store a lookup table that does not depend on the input, and random bits for hash functions. The"main"data structure only occupies $ ext{OPT}+ ext{poly}log n$ bits.
Problem

Research questions and friction points this paper is trying to address.

Design succinct static dictionary with worst-case constant query time
Achieve optimal space usage with minimal extra bits
Improve upon previous space-time tradeoffs in dictionary structures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Succinct static dictionary with constant query time
Optimal space usage with minimal extra bits
Lookup table and hash functions for efficiency
🔎 Similar Papers