Active Multiple-Prediction-Powered Inference

📅 2026-05-08

📈 Citations: 0

✨ Influential: 0

career value

167K/year

🤖 AI Summary

This work addresses the challenge of post-deployment monitoring in medical AI, where acquiring clinical gold-standard labels is costly and existing methods support only single predictors, limiting applicability in multi-model collaborative settings. The authors propose AM-PPI, a framework that, under a unified annotation budget, dynamically selects an optimal subset of predictors per instance and samples labels proportionally to residual uncertainty, combined with a reweighting strategy to minimize estimation variance. AM-PPI is the first to integrate multiple predictors into active statistical inference, enabling instance-adaptive routing. Theoretically, the authors prove the existence of a global optimum for the resulting non-jointly convex optimization problem and derive a closed-form criterion for quantifying multi-predictor gains. Experiments on synthetic data and three clinical tasks show that AM-PPI narrows confidence intervals by 10%–40% over single-predictor ASI methods within critical budget regimes, while matching the best baseline performance otherwise.

📝 Abstract

Post-deployment monitoring of healthcare AI requires statistically valid, label-efficient methods, but gold-standard labels from clinician chart review are expensive. Prediction-powered inference (PPI) and active statistical inference (ASI) reduce label cost by combining a small labeled sample with abundant model predictions, but both are restricted to a single predictor, a poor fit for modern clinical pipelines that have multiple predictors of differing cost and accuracy available at inference time. We propose Active Multiple-Prediction-Powered Inference (AM-PPI), which routes each instance to a cost-appropriate predictor subset, samples gold-standard labels in proportion to the chosen subset's residual uncertainty, and reweights predictions to minimize estimator variance, all under a single deployment-time budget. AM-PPI generalizes ASI to leverage multiple predictors and extends Multiple-PPI from global per-predictor allocation to per-instance adaptive routing. We derive closed-form Karush-Kuhn-Tucker (KKT) conditions for all three decisions and prove, via biconvexity and strong duality, that the resulting fixed point is a global optimum despite the joint problem being non-jointly-convex. We establish asymptotic normality with valid coverage, minimum-variance unbiasedness within the linear-prediction augmented inverse propensity weighted (AIPW) class, and a closed-form criterion identifying when multiple predictors help. On synthetic data and three healthcare monitoring tasks, AM-PPI produces 10 to 40 percent narrower confidence intervals (CIs) than single-predictor ASI in the budget regime where routing matters, and matches the better baseline elsewhere.

Problem

Research questions and friction points this paper is trying to address.

healthcare AI monitoring

label-efficient inference

multiple predictors

prediction-powered inference

active statistical inference

Innovation

Methods, ideas, or system contributions that make the work stand out.

Active Multiple-Prediction-Powered Inference

prediction-powered inference

adaptive routing