🤖 AI Summary
This study investigates whether AI systems can genuinely discover novel knowledge through iterative self-improvement and elucidates their fundamental limitations and associated costs. To this end, the authors propose the NOVA framework, which models the “generate–verify–accumulate–retrain” loop as an adaptive sampling process over a knowledge space. The work formally characterizes the theoretical limits of AI-driven knowledge discovery for the first time, identifying four primary failure modes—including the contamination trap—and establishes a power-law scaling law for cumulative discovery cost. Under the assumption of a Zipf-distributed knowledge space with exponent α > 1, the cumulative cost of obtaining D verified discoveries scales as Θ(c_gen D^α). The analysis further highlights the critical role of expert intervention in overcoming exploration bottlenecks inherent to autonomous discovery processes.
📝 Abstract
Can AI systems discover genuinely new knowledge through iterative self improvement, and if so, at what cost? We introduce the NOVA framework, which models the common ``generate, verify, accumulate, retrain''loop as an adaptive sampling process over a knowledge space. We identify sufficient conditions under which accumulated genuine knowledge eventually covers a finite domain, and show how their violations produce distinct failure modes: contamination, forgetting, exploration failure, and acceptance failure. We then analyze imperfect verification and identify a contamination trap: as easy-to-find knowledge is exhausted, the model mass assigned to new valid artifacts shrinks, so even small false-positive rates can cause invalid artifacts to enter the knowledge base faster than genuine discoveries. We clarify that Good--Turing estimation is a local batch-diversity diagnostic, not an estimator of the historically undiscovered valid mass that governs long-term discovery. Under a separate tail-equivalence assumption relating the model's effective discovery distribution to a Zipf law with exponent $\alpha>1$, we prove that the cumulative generation cost required to obtain $D$ distinct genuine discoveries satisfies $R_{\mathrm{cum}}(D)=\Theta(c_{\mathrm{gen}}D^\alpha)$, where $c_{\mathrm{gen}}$ is the per-candidate generation cost. This scaling law quantifies asymptotic diminishing returns as the discovery frontier advances. Finally, we formalize human amplification through guidance, generation, and verification, explaining why expert input is most valuable near autonomous exploration barriers.