🤖 AI Summary
This paper resolves a long-standing open problem regarding the space complexity of dynamic filters: whether an optimal-space construction exists that avoids fingerprinting techniques. By integrating combinatorial mathematics and information theory, we establish—for the first time—that the space lower bound implicitly imposed by fingerprinting is in fact a universal lower bound for *all* dynamic filters supporting insertions, deletions, and low-false-positive membership queries. Our core contribution is a tight space lower bound in the regime where the false positive rate $varepsilon o 0$: $n log varepsilon^{-1} + n log e - o(n)$ bits. This bound is independent of query/update time and asymptotically optimal. Consequently, the result settles the theoretical debate on the necessity of fingerprinting, demonstrating its inevitability for space-optimality, and establishes a fundamental limit on the succinct representation of dynamic sets.
📝 Abstract
Dynamic filters are data structures supporting approximate membership queries to a dynamic set $S$ of $n$ keys, allowing a small false-positive error rate $varepsilon$, under insertions and deletions to the set $S$. Essentially all known constructions for dynamic filters use a technique known as fingerprinting. This technique, which was first introduced by Carter et al. in 1978, inherently requires $$log inom{n varepsilon^{-1}}{n} = n log varepsilon^{-1} + n log e - o(n)$$ bits of space when $varepsilon = o(1)$. Whether or not this bound is optimal for all dynamic filters (rather than just for fingerprint filters) has remained for decades as one of the central open questions in the area. We resolve this question by proving a sharp lower bound of $n log varepsilon^{-1} + n log e - o(n)$ bits for $varepsilon = o(1)$, regardless of operation time.