A Reproducible Framework for Bias-Resistant Machine Learning on Small-Sample Neuroimaging Data.

📅 2026-02-02
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the optimistic bias and compromised reproducibility in few-shot neuroimaging analysis, where conventional cross-validation often conflates model selection and evaluation by sharing data splits. To mitigate this, the authors propose a bias-resistant, interpretable machine learning framework that integrates domain knowledge–driven feature engineering, importance-guided feature selection, rigorous nested cross-validation, and calibrated decision threshold optimization. Applied to high-dimensional structural MRI data for predicting cognitive outcomes following deep brain stimulation, the approach achieves a nested cross-validated balanced accuracy of 0.660 ± 0.068 using a compact and interpretable feature subset, thereby significantly enhancing the reliability and transparency of small-sample biomedical modeling.

Technology Category

Application Category

📝 Abstract
We introduce a reproducible, bias-resistant machine learning framework that integrates domain-informed feature engineering, nested cross-validation, and calibrated decision-threshold optimization for small-sample neuroimaging data. Conventional cross-validation frameworks that reuse the same folds for both model selection and performance estimation yield optimistically biased results, limiting reproducibility and generalization. Demonstrated on a high-dimensional structural MRI dataset of deep brain stimulation cognitive outcomes, the framework achieved a nested-CV balanced accuracy of 0.660\,$\pm$\,0.068 using a compact, interpretable subset selected via importance-guided ranking. By combining interpretability and unbiased evaluation, this work provides a generalizable computational blueprint for reliable machine learning in data-limited biomedical domains.
Problem

Research questions and friction points this paper is trying to address.

bias
small-sample
neuroimaging
reproducibility
machine learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

bias-resistant machine learning
nested cross-validation
small-sample neuroimaging
interpretable feature selection
calibrated decision-threshold optimization
🔎 Similar Papers
No similar papers found.
J
Jagan Mohan Reddy Dwarampudi
Department of Electrical and Computer Engineering, University of Houston
J
Jennifer L Purks
Department of Neurology, College of Medicine, University of Florida
Joshua Wong
Joshua Wong
University of Florida
Movement disordersneuromodulationdeep brain stimulation
Renjie Hu
Renjie Hu
University of Houston
Machine Learning
Tania Banerjee
Tania Banerjee
University of Houston
AI/MLSmart HealthcareData ScienceHPCIntelligent Transportation Systems