A generalizable foundation model for intraoperative understanding across surgical procedures

πŸ“… 2026-02-14
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of limited generalization in existing AI models for minimally invasive surgery, which stems from variations in surgical procedures and practitioners across institutions. To overcome this, the authors propose ZEN, the first intraoperative video foundation model capable of unified representation across multiple surgical procedures. Trained on a diverse dataset of over four million frames spanning more than 21 procedure types, ZEN leverages a self-supervised multi-teacher distillation framework and introduces a standardized benchmark for downstream tasks. It consistently outperforms current methods across zero-shot, few-shot, frozen-backbone, and full fine-tuning settings, achieving state-of-the-art performance on 20 downstream tasks. These results demonstrate ZEN’s strong cross-procedure and cross-institutional generalization, establishing a new paradigm for intraoperative assistance and intelligent surgical training evaluation.

Technology Category

Application Category

πŸ“ Abstract
In minimally invasive surgery, clinical decisions depend on real-time visual interpretation, yet intraoperative perception varies substantially across surgeons and procedures. This variability limits consistent assessment, training, and the development of reliable artificial intelligence systems, as most surgical AI models are designed for narrowly defined tasks and do not generalize across procedures or institutions. Here we introduce ZEN, a generalizable foundation model for intraoperative surgical video understanding trained on more than 4 million frames from over 21 procedures using a self-supervised multi-teacher distillation framework. We curated a large and diverse dataset and systematically evaluated multiple representation learning strategies within a unified benchmark. Across 20 downstream tasks and full fine-tuning, frozen-backbone, few-shot and zero-shot settings, ZEN consistently outperforms existing surgical foundation models and demonstrates robust cross-procedure generalization. These results suggest a step toward unified representations for surgical scene understanding and support future applications in intraoperative assistance and surgical training assessment.
Problem

Research questions and friction points this paper is trying to address.

intraoperative understanding
surgical generalization
foundation model
visual interpretation variability
surgical AI
Innovation

Methods, ideas, or system contributions that make the work stand out.

foundation model
surgical video understanding
self-supervised learning
multi-teacher distillation
cross-procedure generalization
K
Kanggil Park
Department of Surgery, Samsung Medical Center, Seoul, South Korea.
Y
Yongjun Jeon
Clinical Robotics and Embodied AI Research Center, Smart Healthcare Research Institute, Research Institute for Future Medicine, Samsung Medical Center, Seoul, South Korea.
S
Soyoung Lim
Department of Surgery, Samsung Medical Center, Seoul, South Korea.
S
Seonmin Park
Department of Surgery, Samsung Medical Center, Seoul, South Korea.
Jongmin Shin
Jongmin Shin
Samsung Electronics
Computer Science
J
Jung Yong Kim
Department of Surgery, Samsung Medical Center, Seoul, South Korea.
S
Sehyeon An
Department of Medical Device Management and Research, Samsung Advanced Institute for Health Sciences & Technology (SAIHST), Sungkyunkwan University, Seoul, South Korea.
Jinsoo Rhu
Jinsoo Rhu
Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine
Transplantationhepatologyhepatobiliary surgeryartificial intelligence
Jongman Kim
Jongman Kim
Department of Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine
Liver transplantationHepatocellular carcinomaMinimal invasive surgeryLiver SurgeryImmunosuppression
G
Gyu-Seong Choi
Department of Surgery, Samsung Medical Center, Seoul, South Korea.
Namkee Oh
Namkee Oh
Department of Surgery, Samsung Medical Center
minimally invasive liver surgerytransplantationAI in surgery
Kyu-Hwan Jung
Kyu-Hwan Jung
Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University
Deep LearningMachine LearningArtificial IntelligenceMedical Image AnalysisDigital Health