🤖 AI Summary
Critical vision systems—e.g., autonomous driving perception modules—are prone to failures such as false detections, adversarial vulnerability, and hallucinations under rare or unseen scenarios, yet lack interpretable failure identification and modeling capabilities. To address this, we propose CAIRO: the first context-aware, human-judgment-incentivized, ontology-driven, knowledge-graph-based “human-in-the-loop” failure discovery framework. CAIRO formalizes black-box model failures as OWL/XML-encoded knowledge graphs, enabling shareable, logically inferable, and auditable failure representations. Technically, it integrates ontology engineering, human–machine collaborative testing, robustness analysis for object detection, and explainable AI (XAI). Evaluated on real-world autonomous driving perception systems, CAIRO significantly improves the efficiency of critical failure mode discovery, enhances semantic interpretability of failures, and enables cross-team reuse of failure models and insights.
📝 Abstract
Vision systems are increasingly deployed in critical domains such as surveillance, law enforcement, and transportation. However, their vulnerabilities to rare or unforeseen scenarios pose significant safety risks. To address these challenges, we introduce Context-Awareness and Interpretability of Rare Occurrences (CAIRO), an ontology-based human-assistive discovery framework for failure cases (or CP - Critical Phenomena) detection and formalization. CAIRO by design incentivizes human-in-the-loop for testing and evaluation of criticality that arises from misdetections, adversarial attacks, and hallucinations in AI black-box models. Our robust analysis of object detection model(s) failures in automated driving systems (ADS) showcases scalable and interpretable ways of formalizing the observed gaps between camera perception and real-world contexts, resulting in test cases stored as explicit knowledge graphs (in OWL/XML format) amenable for sharing, downstream analysis, logical reasoning, and accountability.