The Complexity of Finding Missing Answer Repairs

šŸ“… 2026-01-11
šŸ›ļø arXiv.org
šŸ“ˆ Citations: 0
✨ Influential: 0
šŸ“„ PDF
šŸ¤– AI Summary
This study investigates the existence and computational complexity of missing query answers in database repair. Focusing on unions of conjunctive queries with negated atoms, it delineates the boundary of polynomial-time solvability for repair problems under combined complexity when weak projection and selection are permitted, and demonstrates that the coexistence of projection and join induces NP-hardness as well as set-cover-type approximation hardness. The main contributions include establishing the OptP[log n]-completeness of computing the size of a minimal repair and an EXP lower bound in the recursive setting; proving that semi-positive Datalog programs can identify minimal repairs in polynomial time under data complexity; and presenting a repair construction algorithm requiring only O(n²) calls to an NP oracle.

Technology Category

Application Category

šŸ“ Abstract
We investigate the problem of identifying database repairs for missing tuples in query answers. We show that when the query is part of the input - the combined complexity setting - determining whether or not a repair exists is polynomial-time is equivalent to the satisfiability problem for classes of queries admitting a weak form of projection and selection. We then identify the sub-classes of unions of conjunctive queries with negated atoms, defined by the relational algebra operations permitted to appear in the query, for which the minimal repair problem can be solved in polynomial time. In contrast, we show that the problem is NP-hard, as well as set cover-hard to approximate via strict reductions, whenever both projection and join are permitted in the input query. Additionally, we show that finding the size of a minimal repair for unions of conjunctive queries (with negated atoms permitted) is OptP[log(n)]-complete, while computing a minimal repair is possible with O($n^2$) queries to an NP oracle. With recursion permitted, the combined complexity of all of these variants increases significantly, with an EXP lower bound. However, from the data complexity perspective, we show that minimal repairs can be identified in polynomial time for all queries expressible as semi-positive datalog programs.
Problem

Research questions and friction points this paper is trying to address.

missing answer repairs
database repair
conjunctive queries
data complexity
combined complexity
Innovation

Methods, ideas, or system contributions that make the work stand out.

database repair
missing answers
conjunctive queries with negation
combined complexity
data complexity
šŸ”Ž Similar Papers
No similar papers found.
J
Jesse Comer
University of Pennsylvania, United States
Val Tannen
Val Tannen
University of Pennsylvania