🤖 AI Summary
This paper challenges the conflation of certified robustness with holistic AI safety, exposing three structural deficiencies: (1) insufficient discriminative power in detection mechanisms, (2) absence of standardized evaluation criteria, and (3) misuse of user trust in “guarantee” claims. To address these, we advance the core thesis “Certification ≠ Safety” and propose a cross-layer critical analysis framework integrating human factors engineering, formal boundary analysis of verification guarantees, and empirical deployment case studies. We introduce, for the first time, systematic principles for evaluating the practical utility of certification schemes—shifting emphasis from theoretical assurances to deployable, operationally sound security practices. Our work establishes a methodological foundation for community-wide norms governing responsible robustness claims and supports the construction of trustworthy AI safety stacks. (138 words)
📝 Abstract
While certified robustness is widely promoted as a solution to adversarial examples in Artificial Intelligence systems, significant challenges remain before these techniques can be meaningfully deployed in real-world applications. We identify critical gaps in current research, including the paradox of detection without distinction, the lack of clear criteria for practitioners to evaluate certification schemes, and the potential security risks arising from users' expectations surrounding ``guaranteed"robustness claims. This position paper is a call to arms for the certification research community, proposing concrete steps to address these fundamental challenges and advance the field toward practical applicability.