Toward Faithfulness-guided Ensemble Interpretation of Neural Network

📅 2025-09-04

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Existing neural network explanation methods suffer from insufficient faithfulness and lack effective means to evaluate intermediate-layer reasoning processes. Method: We propose FEI (Faithfulness-Enhanced Integrated), a novel explanation framework comprising: (i) smoothed approximation techniques to improve quantitative faithfulness scores; (ii) a new qualitative faithfulness metric for hidden layers—enabling, for the first time, interpretable assessment of intermediate reasoning paths; and (iii) integration of ensemble-based explanation, hidden-layer encoding analysis, and multi-granularity visualization to establish a synergistic, multi-dimensional evaluation system combining quantitative and qualitative analysis. Results: Extensive experiments demonstrate that FEI significantly outperforms state-of-the-art methods across mainstream benchmarks, achieving substantial gains in visualization fidelity and faithfulness metrics—e.g., 32.7% reduction in Infidelity and 28.4% improvement in Erasure. FEI establishes a verifiable, diagnosable explanation paradigm for trustworthy AI.

Technology Category

Application Category

📝 Abstract

Interpretable and faithful explanations for specific neural inferences are crucial for understanding and evaluating model behavior. Our work introduces extbf{F}aithfulness-guided extbf{E}nsemble extbf{I}nterpretation ( extbf{FEI}), an innovative framework that enhances the breadth and effectiveness of faithfulness, advancing interpretability by providing superior visualization. Through an analysis of existing evaluation benchmarks, extbf{FEI} employs a smooth approximation to elevate quantitative faithfulness scores. Diverse variations of extbf{FEI} target enhanced faithfulness in hidden layer encodings, expanding interpretability. Additionally, we propose a novel qualitative metric that assesses hidden layer faithfulness. In extensive experiments, extbf{FEI} surpasses existing methods, demonstrating substantial advances in qualitative visualization and quantitative faithfulness scores. Our research establishes a comprehensive framework for elevating faithfulness in neural network explanations, emphasizing both breadth and precision

Problem

Research questions and friction points this paper is trying to address.

Enhancing faithfulness in neural network interpretation

Improving interpretability through superior visualization techniques

Advancing quantitative and qualitative faithfulness evaluation metrics

Innovation

Methods, ideas, or system contributions that make the work stand out.

Faithfulness-guided Ensemble Interpretation framework

Smooth approximation for quantitative faithfulness scores

Novel qualitative metric for hidden layer assessment

🔎 Similar Papers

No similar papers found.

Authors to Follow