H-CNN-ViT: A Hierarchical Gated Attention Multi-Branch Model for Bladder Cancer Recurrence Prediction

📅 2025-11-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Bladder cancer exhibits a high postoperative recurrence rate (78%), yet multiparametric contrast-enhanced MRI interpretation remains challenging due to post-surgical confounders—including fibrosis and edema—and the absence of dedicated, publicly available datasets for AI-driven recurrence prediction. To address these limitations, we propose a hierarchical gated-attention multi-branch network that jointly leverages CNN and Vision Transformer (ViT) pathways to model each MRI sequence independently, followed by context-aware dynamic weighting for global–local feature fusion. We introduce and publicly release the first multimodal MRI dataset specifically designed for bladder cancer recurrence prediction. Evaluated on our curated dataset, our model achieves an AUC of 78.6%, significantly outperforming existing methods. The source code and dataset are fully open-sourced. Moreover, the model provides clinically interpretable attention maps, demonstrating strong translational potential for clinical deployment.

Technology Category

Application Category

📝 Abstract
Bladder cancer is one of the most prevalent malignancies worldwide, with a recurrence rate of up to 78%, necessitating accurate post-operative monitoring for effective patient management. Multi-sequence contrast-enhanced MRI is commonly used for recurrence detection; however, interpreting these scans remains challenging, even for experienced radiologists, due to post-surgical alterations such as scarring, swelling, and tissue remodeling. AI-assisted diagnostic tools have shown promise in improving bladder cancer recurrence prediction, yet progress in this field is hindered by the lack of dedicated multi-sequence MRI datasets for recurrence assessment study. In this work, we first introduce a curated multi-sequence, multi-modal MRI dataset specifically designed for bladder cancer recurrence prediction, establishing a valuable benchmark for future research. We then propose H-CNN-ViT, a new Hierarchical Gated Attention Multi-Branch model that enables selective weighting of features from the global (ViT) and local (CNN) paths based on contextual demands, achieving a balanced and targeted feature fusion. Our multi-branch architecture processes each modality independently, ensuring that the unique properties of each imaging channel are optimally captured and integrated. Evaluated on our dataset, H-CNN-ViT achieves an AUC of 78.6%, surpassing state-of-the-art models. Our model is publicly available at https://github.com/XLIAaron/H-CNN-ViT}.
Problem

Research questions and friction points this paper is trying to address.

Predict bladder cancer recurrence using multi-sequence MRI scans
Address challenges in interpreting post-surgical MRI alterations
Overcome lack of dedicated datasets for recurrence assessment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Gated Attention Multi-Branch model
Selective weighting of global and local features
Independent multi-modal processing for optimal integration
🔎 Similar Papers
No similar papers found.
Xueyang Li
Xueyang Li
University of Notre Dame
Medical Image
Z
Zongren Wang
Department of Urology, The First Affiliated Hospital, Sun Yat-sen University, China
Y
Yuliang Zhang
Department of Urology, The First Affiliated Hospital, Sun Yat-sen University, China
Z
Zixuan Pan
Computer Science and Engineering, University of Notre Dame, USA
Y
Yu-Jen Chen
Computer Science and Engineering, University of Notre Dame, USA
Nishchal Sapkota
Nishchal Sapkota
Ph.D Candidate, University of Notre Dame
Computer VisionDeep LearningSelf-supervised LearningAI for HealthcareMathematical Modeling
Gelei Xu
Gelei Xu
University of Notre Dame
D
Danny Z. Chen
Computer Science and Engineering, University of Notre Dame, USA
Yiyu Shi
Yiyu Shi
Full Professor, University of Notre Dame
hardware/software co-designdeep learning accelerationon-device AIAI for healthcare