UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System

📅 2025-05-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Medical imaging multi-task computer-aided diagnosis (CAD) systems face challenges including complex pretraining requirements and insufficient open-source platform support. To address these, we propose a unified multi-task diagnostic architecture for both 2D and 3D medical images: it freezes a vision foundation model and employs low-rank adaptation (LoRA)—introducing only 0.17% trainable parameters—for efficient fine-tuning; additionally, it incorporates plug-and-play task-specific expert modules to enable flexible functional expansion. This design substantially reduces task adaptation overhead while maintaining high diagnostic accuracy and deployment efficiency. Evaluated on 12 mainstream medical imaging datasets, our method consistently outperforms existing state-of-the-art approaches. Furthermore, we publicly release the complete codebase, a lightweight expert model library, and an integrated platform—thereby fostering reproducible, extensible, and clinically translatable medical AI research and deployment.

Technology Category

Application Category

📝 Abstract
The growing complexity and scale of visual model pre-training have made developing and deploying multi-task computer-aided diagnosis (CAD) systems increasingly challenging and resource-intensive. Furthermore, the medical imaging community lacks an open-source CAD platform to enable the rapid creation of efficient and extendable diagnostic models. To address these issues, we propose UniCAD, a unified architecture that leverages the robust capabilities of pre-trained vision foundation models to seamlessly handle both 2D and 3D medical images while requiring only minimal task-specific parameters. UniCAD introduces two key innovations: (1) Efficiency: A low-rank adaptation strategy is employed to adapt a pre-trained visual model to the medical image domain, achieving performance on par with fully fine-tuned counterparts while introducing only 0.17% trainable parameters. (2) Plug-and-Play: A modular architecture that combines a frozen foundation model with multiple plug-and-play experts, enabling diverse tasks and seamless functionality expansion. Building on this unified CAD architecture, we establish an open-source platform where researchers can share and access lightweight CAD experts, fostering a more equitable and efficient research ecosystem. Comprehensive experiments across 12 diverse medical datasets demonstrate that UniCAD consistently outperforms existing methods in both accuracy and deployment efficiency. The source code and project page are available at https://mii-laboratory.github.io/UniCAD/.
Problem

Research questions and friction points this paper is trying to address.

Addresses resource-intensive multi-task CAD system development
Lacks open-source platform for extendable diagnostic models
Unifies 2D/3D medical image handling with minimal parameters
Innovation

Methods, ideas, or system contributions that make the work stand out.

Leverages pre-trained vision foundation models
Uses low-rank adaptation for efficiency
Modular plug-and-play architecture for expandability
Yitao Zhu
Yitao Zhu
Hong Kong Polytechnic University
Medical Image AnalysisComputer VisionFoundation Model
Y
Yuan Yin
School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, 201210, Shanghai, China
Zhenrong Shen
Zhenrong Shen
Shanghai Jiao Tong University
Medical Image ComputingMedical Image AnalysisComputer VisionDeep Learning
Z
Zihao Zhao
School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, 201210, Shanghai, China
H
Haiyu Song
School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, 201210, Shanghai, China
S
Sheng Wang
School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, 200030, Shanghai, China
Dinggang Shen
Dinggang Shen
Prof. and Founding Dean, School of BME, ShanghaiTech University; Co-CEO, United Imaging Intelligence
Medical Image AnalysisMedical Image ComputingBiomedical Image AnalysisImage Registration
Q
Qian Wang
School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, 201210, Shanghai, China; Shanghai Clinical Research and Trial Center, Shanghai, 201210, Shanghai, China