Jaguar: A Primal Algorithm for Conjunctive Query Evaluation in Submodular-Width Time

๐Ÿ“… 2026-03-13
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the problem of efficiently evaluating conjunctive queries (CQs) within time complexity approaching the theoretical lower bound. To this end, it introduces Jaguar, a native-space algorithm that dynamically controls computational cost directly on the input query through information-theoretically guided polyhedral modeling, adaptive join scheduling, and guided query decomposition. Jaguar is the first native-space algorithm to achieve a running time of \(O(N^{\text{subw}(Q) + \varepsilon})\) for any Boolean CQ \(Q\) and arbitrary \(\varepsilon > 0\), thereby asymptotically matching the submodular-width lower bound. The approach also supports a generalized formulation incorporating degree constraints. Compared to existing methods, Jaguarโ€™s design and analysis are significantly simplified while preserving optimality.

Technology Category

Application Category

๐Ÿ“ Abstract
The submodular width is a complexity measure of conjunctive queries (CQs), which assigns a nonnegative real number, subw(Q), to each CQ Q. An existing algorithm, called PAND, performs CQ evaluation in polynomial time where the exponent is essentially subw(Q). Formally, for every Boolean CQ Q, PANDA evaluates Q in time $O(N^{\mathsf{subw}(Q)} \cdot \mathsf{polylog}(N))$, where N denotes the input size; moreover, there is complexity-theoretic evidence that, for a number of Boolean CQs, no exponent strictly below subw(Q) can be achieved by combinatorial algorithms. On a high level, the submodular width of a CQ Q can be described as the maximum over all polymatroids, which are set functions on the variables of Q that satisfy Shannon inequalities. The PANDA algorithm in a sense works in the dual space of this maximization problem, makes use of information theory, and transforms a CQ into a set of disjunctive datalog programs which are individually solved. In this article, we introduce a new algorithm for CQ evaluation which achieves, for each Boolean CQ Q and for all epsilon > 0, a running time of $O(N^{\mathsf{subw}(Q)+ฮต})$. This new algorithm's description and analysis are, in our view, significantly simpler than those of PANDA. We refer to it as a "primal" algorithm as it operates in the primal space of the described maximization problem, by maintaining a feasible primal solution, namely, a polymatroid. Indeed, this algorithm deals directly with the input CQ and adaptively computes a sequence of joins, in a guided fashion, so that the cost of these join computations is bounded. Additionally, this algorithm can achieve the stated runtime for the generalization of the submodular width incorporating degree constraints. We dub our algorithm Jaguar, as it is a join-adaptive guided algorithm.
Problem

Research questions and friction points this paper is trying to address.

conjunctive query
submodular width
query evaluation
degree constraints
polymatroid
Innovation

Methods, ideas, or system contributions that make the work stand out.

conjunctive query evaluation
submodular width
primal algorithm
polymatroid
adaptive join
๐Ÿ”Ž Similar Papers
2024-04-15Annual Meeting of the Association for Computational LinguisticsCitations: 4