๐ค AI Summary
This work investigates how to quantify and reduce the query burden on human supervisors when verifying complex computational tasks within AI debate frameworks. It introduces Debate Query Complexity (DQC) as a formal measure of the minimal information a human must inspect to make correct judgments, and for the first time systematically connects supervision costs in AI debates to computational complexity theory. The main contributions include proving that problems in PSPACE/poly correspond precisely to those with DQC O(log n), establishing a tight relationship between DQC and circuit complexity by showing DQC โค log(s) + 3 where s is the circuit size, and demonstrating that improving DQC lower bounds would yield new circuit complexity lower boundsโthereby revealing a profound link between AI safety and fundamental limits in computational complexity.
๐ Abstract
AI safety via debate uses two competing models to help a human judge verify complex computational tasks. Previous work has established what problems debate can solve in principle, but has not analysed the practical cost of human oversight: how many queries must the judge make to the debate transcript? We introduce Debate Query Complexity}(DQC), the minimum number of bits a verifier must inspect to correctly decide a debate. Surprisingly, we find that PSPACE/poly (the class of problems which debate can efficiently decide) is precisely the class of functions decidable with O(log n) queries. This characterisation shows that debate is remarkably query-efficient: even for highly complex problems, logarithmic oversight suffices. We also establish that functions depending on all their input bits require Omega(log n) queries, and that any function computable by a circuit of size s satisfies DQC(f)<= log(s) + 3. Interestingly, this last result implies that proving DQC lower bounds of log(n) + 6 for languages in P would yield new circuit lower bounds, connecting debate query complexity to central questions in circuit complexity.