Neural Concept Verifier: Scaling Prover-Verifier Games via Concept Encodings

📅 2025-07-10

📈 Citations: 0

✨ Influential: 0

career value

232K/year

🤖 AI Summary

Existing Prover-Verifier Games (PVGs) struggle with high-dimensional inputs (e.g., images), while Concept Bottleneck Models (CBMs) rely on low-capacity linear predictors, limiting both interpretability and formal verifiability for nonlinear classification. Method: We propose Neural Concept Verifier (NCV), the first framework to tightly integrate PVG mechanisms with concept bottleneck modeling. NCV employs weakly supervised concept discovery to extract structured semantic concepts; a prover selects salient concepts, and a verifier performs nonlinear, concept-level classification and formal verification. Contribution/Results: NCV effectively mitigates shortcut learning, achieving superior classification accuracy and verifiability over conventional CBMs and pixel-level PVGs across multiple high-dimensional benchmarks. It advances both predictive performance and rigorous, concept-grounded verification for complex visual data.

Technology Category

Application Category

📝 Abstract

While Prover-Verifier Games (PVGs) offer a promising path toward verifiability in nonlinear classification models, they have not yet been applied to complex inputs such as high-dimensional images. Conversely, Concept Bottleneck Models (CBMs) effectively translate such data into interpretable concepts but are limited by their reliance on low-capacity linear predictors. In this work, we introduce the Neural Concept Verifier (NCV), a unified framework combining PVGs with concept encodings for interpretable, nonlinear classification in high-dimensional settings. NCV achieves this by utilizing recent minimally supervised concept discovery models to extract structured concept encodings from raw inputs. A prover then selects a subset of these encodings, which a verifier -- implemented as a nonlinear predictor -- uses exclusively for decision-making. Our evaluations show that NCV outperforms CBM and pixel-based PVG classifier baselines on high-dimensional, logically complex datasets and also helps mitigate shortcut behavior. Overall, we demonstrate NCV as a promising step toward performative, verifiable AI.

Problem

Research questions and friction points this paper is trying to address.

Extending Prover-Verifier Games to high-dimensional complex inputs

Combining concept encodings with nonlinear predictors for interpretability

Mitigating shortcut behavior in high-dimensional logically complex datasets

Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines PVGs with concept encodings

Uses minimally supervised concept discovery

Employs nonlinear predictor for verification

🔎 Similar Papers

No similar papers found.