🤖 AI Summary
This paper addresses the applicability of training-conditional conformal prediction (CP) for statistical safety certification of dynamic systems, identifying its failure to guarantee strict coverage in binomial proportion confidence interval (BPCI) estimation—rendering it unsuitable for safety-critical certification. Method: Through rigorous theoretical analysis and hypothesis testing, the authors formally examine coverage guarantees under finite-sample settings. Contribution/Results: The work provides the first formal proof that training-conditional CP lacks distribution-free coverage guarantees in finite samples, whereas classical BPCI methods—including Clopper–Pearson and Wilson intervals—deliver tight, distribution-agnostic safety assurances. The paper exposes a fundamental limitation of CP in discrete proportion inference, advocating a return to statistically validated classical methods for safety-sensitive control tasks. It establishes a theoretical criterion and methodological guideline for safety certification in control systems.
📝 Abstract
Estimating the expectation of a Bernoulli random variable based on N independent trials is a classical problem in statistics, typically addressed using Binomial Proportion Confidence Intervals (BPCI). In the control systems community, many critical tasks-such as certifying the statistical safety of dynamical systems-can be formulated as BPCI problems. Conformal Prediction (CP), a distribution-free technique for uncertainty quantification, has gained significant attention in recent years and has been applied to various control systems problems, particularly to address uncertainties in learned dynamics or controllers. A variant known as training-conditional CP was recently employed to tackle the problem of safety certification. In this note, we highlight that the use of training-conditional CP in this context does not provide valid safety guarantees. We demonstrate why CP is unsuitable for BPCI problems and argue that traditional BPCI methods are better suited for statistical safety certification.