SHAPoint: Task-Agnostic, Efficient, and Interpretable Point-Based Risk Scoring via Shapley Values

📅 2025-09-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional interpretable risk scoring methods rely on manual preprocessing, task-specific modeling, and strong assumptions—limiting flexibility and predictive performance. To address this, we propose SHAPoint: a task-agnostic, efficient point-based risk scoring framework that systematically integrates Shapley values into point-based scoring for the first time. SHAPoint synergizes the high predictive accuracy of gradient-boosted trees with the theoretically grounded, local interpretability of Shapley values. It requires no feature engineering, natively handles missing data and monotonicity constraints, and applies seamlessly to classification, regression, and survival analysis. Experiments demonstrate that SHAPoint matches the predictive accuracy of state-of-the-art models while achieving substantially faster inference. Moreover, it yields concise, compact, and clinically interpretable scoring rules. By unifying rigorous attribution theory with practical risk modeling, SHAPoint establishes a general, robust paradigm for interpretable risk assessment.

Technology Category

Application Category

📝 Abstract
Interpretable risk scores play a vital role in clinical decision support, yet traditional methods for deriving such scores often rely on manual preprocessing, task-specific modeling, and simplified assumptions that limit their flexibility and predictive power. We present SHAPoint, a novel, task-agnostic framework that integrates the predictive accuracy of gradient boosted trees with the interpretability of point-based risk scores. SHAPoint supports classification, regression, and survival tasks, while also inheriting valuable properties from tree-based models, such as native handling of missing data and support for monotonic constraints. Compared to existing frameworks, SHAPoint offers superior flexibility, reduced reliance on manual preprocessing, and faster runtime performance. Empirical results show that SHAPoint produces compact and interpretable scores with predictive performance comparable to state-of-the-art methods, but at a fraction of the runtime, making it a powerful tool for transparent and scalable risk stratification.
Problem

Research questions and friction points this paper is trying to address.

Develops task-agnostic interpretable risk scoring via Shapley values
Integrates gradient boosted trees accuracy with point-based interpretability
Enables transparent risk stratification with minimal preprocessing and faster runtime
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates gradient boosted trees with interpretable risk scores
Supports classification, regression, and survival analysis tasks
Provides fast runtime with native missing data handling
🔎 Similar Papers
No similar papers found.
T
Tomer D. Meirman
Faculty of Computer and Information Sciences, Ben-Gurion University of the Negev, Beer–Sheva, Israel
Bracha Shapira
Bracha Shapira
Ben-Gurion University of the Negev
Machine LearningRecommender SystemsCyber Security
Noa Dagan
Noa Dagan
Clalit Research Institute and Ben-Gurion University, Israel
Clinical prediction modelsCausal inferenceAlgorithmic fairness
L
Lior S. Rokach
Faculty of Computer and Information Sciences, Ben-Gurion University of the Negev, Beer–Sheva, Israel