STRIKE: Additive Feature-Group-Aware Stacking Framework for Credit Default Prediction

📅 2026-04-19

📈 Citations: 0

✨ Influential: 0

career value

190K/year

🤖 AI Summary

This study addresses the challenges posed by high-dimensional, heterogeneous, and noisy credit data, which often lead traditional monolithic models to overfit and exhibit poor robustness under distributional shifts. To mitigate these issues, the authors propose a feature-group-aware stacking framework that partitions the feature space into semantically coherent groups, trains dedicated learners on each group, and aggregates their predictions via a meta-learner to construct a modular and interpretable additive risk model. Rather than increasing model complexity alone, this approach enhances performance through meaningful feature decomposition. Empirical evaluation on three real-world credit risk datasets demonstrates that the proposed method achieves significantly higher AUC-ROC scores than strong baseline tree-based models and conventional stacking approaches, confirming its superior generalization ability and stability.

Technology Category

Application Category

📝 Abstract

Credit risk default prediction remains a cornerstone of risk management in the financial industry. The task involves estimating the likelihood that a borrower will fail to meet debt obligations, an objective critical for lending decisions, portfolio optimization, and regulatory compliance. Traditional machine learning models such as logistic regression and tree-based ensembles are widely adopted for their interpretability and strong empirical performance. However, modern credit datasets are high-dimensional, heterogeneous, and noisy, increasing overfitting risk in monolithic models and reducing robustness under distributional shift. We introduce STRIKE (Stacking via Targeted Representations of Isolated Knowledge Extractors), a feature-group-aware stacking framework for structured tabular credit risk data. Rather than training a single monolithic model on the complete dataset, STRIKE partitions the feature space into semantically coherent groups and trains independent learners within each group. This decomposition is motivated by an additive perspective on risk modeling, where distinct feature sources contribute complementary evidence that can be combined through a structured aggregation. The resulting group-specific predictions are integrated through a meta-learner that aggregates signals while maintaining robustness and modularity. We evaluate STRIKE on three real-world datasets spanning corporate bankruptcy and consumer lending scenarios. Across all settings, STRIKE consistently outperforms strong tree-based baselines and conventional stacking approaches in terms of AUC-ROC. Ablation studies confirm that performance gains stem from meaningful feature decomposition rather than increased model complexity. Our findings demonstrate that STRIKE is a stable, scalable, and interpretable framework for credit risk default prediction tasks.

Problem

Research questions and friction points this paper is trying to address.

credit default prediction

high-dimensional data

distributional shift

overfitting

feature heterogeneity

Innovation

Methods, ideas, or system contributions that make the work stand out.

feature-group-aware

stacking framework

credit default prediction