An Optimised Greedy-Weighted Ensemble Framework for Financial Loan Default Prediction

📅 2026-03-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes BlendNet, a particle swarm optimization (PSO)-based weighted ensemble framework designed to address the limitations of traditional models in financial loan default prediction, which stem from nonlinear relationships, class imbalance, and dynamic shifts in borrower behavior. BlendNet integrates tree-based models and neural networks, employing recursive feature elimination for feature selection and a dynamic greedy weighting mechanism that assigns base model weights based on empirical performance. To capture higher-order interactions among model outputs while ensuring both predictive accuracy and well-calibrated probabilities, a neural network meta-learner is introduced in a stacking architecture. Evaluated on the Lending Club dataset, BlendNet achieves an AUC of 0.80, a macro-averaged F1-score of 0.73, and a default recall of 0.81, significantly outperforming individual baseline models.

Technology Category

Application Category

📝 Abstract
Accurate prediction of loan defaults is a central challenge in credit risk management, particularly in modern financial datasets characterised by nonlinear relationships, class imbalance, and evolving borrower behaviour. Traditional statistical models and static ensemble methods often struggle to maintain reliable performance under such conditions. This study proposes an Optimised Greedy-Weighted Ensemble framework for loan default prediction that dynamically allocates model weights based on empirical predictive performance. The framework integrates multiple machine learning classifiers, with their hyperparameters first optimised using Particle Swarm Optimisation. Model predictions are then combined via a regularised greedy weighting mechanism. At the same time, a neural-network-based meta-learner is employed within stacked-ensemble to capture higher-order relationships among model outputs. Experiments conducted on the Lending Club dataset demonstrate that the proposed framework improves predictive performance compared with individual classifiers. The BlendNet ensemble achieved the strongest results with an AUC of 0.80, a macro-average F1-score of 0.73, and a default recall of 0.81. Calibration analysis further shows that tree-based ensembles such as Extra Trees and Gradient Boosting provide the most reliable probability estimates, while the stacked ensemble offers superior ranking capability. Feature analysis using Recursive Feature Elimination identifies revolving utilisation, annual income, and debt-to-income ratio as the most influential predictors of loan default. These findings demonstrate that performance-driven ensemble weighting can improve both predictive accuracy and interpretability in credit risk modelling. The proposed framework provides a scalable data-driven approach to support institutional credit assessment, risk monitoring, and financial decision-making.
Problem

Research questions and friction points this paper is trying to address.

loan default prediction
credit risk management
class imbalance
nonlinear relationships
ensemble learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Greedy-Weighted Ensemble
Particle Swarm Optimisation
Stacked Ensemble
Neural Network Meta-Learner
Credit Risk Prediction
🔎 Similar Papers
No similar papers found.
E
Ezekiel Nii Noye Nortey
Department of Statistics and Actuarial Science, University of Ghana, P. O. Box LG 115, Legon, Accra, Ghana.
J
Jones Asante-Koranteng
Department of Statistics, Tali Graduate school, Dominion University College, P. O. Box LG 80, Legon, Accra, Ghana.
Marcellin Atemkeng
Marcellin Atemkeng
Associate Professor of Applied Mathematics & Machine Learning, Rhodes University
Big DataStatistical Signal ProcessingeXplainable AIDeep LearningRadio Astronomy
T
Theophilus Ansah-Narh
Ghana Space Science and Technology Institute, Ghana Atomic Energy Commission, P. O. Box LG 80, Legon, Accra, Ghana.
D
David Mensah
Department of Statistics, Tali Graduate school, Dominion University College, P. O. Box LG 80, Legon, Accra, Ghana.
R
Rebecca Davis
Department of Mathematics and Actuarial Science, Pentecost University, P.O. Box KN 1739, Kaneshie, Accra, Ghana.
R
Ravenhill Adjetey Laryea
Department of Economics and Actuarial Science, University of Professional Studies, P. O. Box LG 149, Legon, Accra, Ghana.