๐ค AI Summary
This paper addresses the critical issue of excessively wide prediction intervals in cross-conformal prediction, which severely limits practical utility. Under the strict theoretical guarantee of marginal coverage at least $1-2alpha$, we propose a novel randomized $p$-value aggregation method. Grounded in exchangeability theory, our approach introduces a randomization mechanism to optimize $p$-value combination and reformulates both the cross-conformal framework and folding strategy, thereby breaking the conventional widthโcoverage trade-off bottleneck. Theoretical analysis confirms tight coverage guarantees, while simulations demonstrate a 15โ30% reduction in average prediction interval width, substantially improving statistical efficiency. The core innovation lies in the principled integration of randomization with exchangeability, enabling more efficient and compact conformal inference.
๐ Abstract
Vovk (2015) introduced cross-conformal prediction, a modification of split conformal designed to improve the width of prediction sets. The method, when trained with a miscoverage rate equal to $alpha$ and $n gg K$, ensures a marginal coverage of at least $1 - 2alpha - 2(1-alpha)(K-1)/(n+K)$, where $n$ is the number of observations and $K$ denotes the number of folds. A simple modification of the method achieves coverage of at least $1-2alpha$. In this work, we propose new variants of both methods that yield smaller prediction sets without compromising the latter theoretical guarantee. The proposed methods are based on recent results deriving more statistically efficient combination of p-values that leverage exchangeability and randomization. Simulations confirm the theoretical findings and bring out some important tradeoffs.