PanTS: The Pancreatic Tumor Segmentation Dataset

📅 2025-07-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Limited by small-scale data and insufficient anatomical context, pancreatic tumor detection, localization, and segmentation remain challenging. To address this, we introduce PanTS—the largest and most comprehensive multicenter pancreatic CT dataset to date—comprising 36,390 contrast-enhanced CT scans and over 993,000 voxel-level annotations covering tumors and 24 surrounding anatomical structures. We propose a novel multi-institutional collaborative annotation framework, incorporating expert-validated fine-grained labeling and rich metadata (e.g., age, sex, diagnosis, contrast phase). PanTS contains 16× more tumor annotations than all existing public datasets combined and is the first to systematically support tumor–anatomy co-modeling. Models trained on PanTS achieve statistically significant improvements over state-of-the-art baselines on detection, localization, and segmentation tasks, establishing new performance benchmarks. This work demonstrates that large-scale, high-fidelity anatomical context data is critical for advancing AI-driven pancreatic tumor analysis.

Technology Category

Application Category

📝 Abstract
PanTS is a large-scale, multi-institutional dataset curated to advance research in pancreatic CT analysis. It contains 36,390 CT scans from 145 medical centers, with expert-validated, voxel-wise annotations of over 993,000 anatomical structures, covering pancreatic tumors, pancreas head, body, and tail, and 24 surrounding anatomical structures such as vascular/skeletal structures and abdominal/thoracic organs. Each scan includes metadata such as patient age, sex, diagnosis, contrast phase, in-plane spacing, slice thickness, etc. AI models trained on PanTS achieve significantly better performance in pancreatic tumor detection, localization, and segmentation compared to those trained on existing public datasets. Our analysis indicates that these gains are directly attributable to the 16x larger-scale tumor annotations and indirectly supported by the 24 additional surrounding anatomical structures. As the largest and most comprehensive resource of its kind, PanTS offers a new benchmark for developing and evaluating AI models in pancreatic CT analysis.
Problem

Research questions and friction points this paper is trying to address.

Advancing pancreatic tumor segmentation research with large-scale CT dataset
Improving AI model performance in pancreatic tumor detection and localization
Providing comprehensive benchmark for pancreatic CT analysis AI development
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large-scale multi-institutional pancreatic CT dataset
Expert-validated voxel-wise annotations for 993k structures
AI models show 16x better tumor detection
🔎 Similar Papers
No similar papers found.
Wenxuan Li
Wenxuan Li
Johns Hopkins University
Imaging InformaticsComputer-aided Diagnosis
X
Xinze Zhou
Department of Computer Science, Johns Hopkins University
Q
Qi Chen
Department of Computer Science, Johns Hopkins University
Tianyu Lin
Tianyu Lin
Johns Hopkins University
Medical Image AnalysisComputer Vision
P
Pedro R. A. S. Bassi
Department of Pharmacy and Biotechnology, University of Bologna
Szymon Płotka
Szymon Płotka
Jagiellonian University
Machine LearningDeep LearningComputer VisionMedical Imaging
J
Jarosław B. Ćwikła
Department of Cardiology and Internal Medicine, University of Warmia and Mazury
Xiaoxi Chen
Xiaoxi Chen
University of Illinois Urbana-Champaign
Diagnostic RadiologyTranslational MedicineQuantitative Medical ImagingAI in Medical Imaging
C
Chen Ye
Department of General Surgery, Peking University Third Hospital
Z
Zheren Zhu
Department of Radiology & Biomedical Imaging, University of California, San Francisco
K
Kai Ding
Department of Radiation Oncology, Johns Hopkins School of Medicine
H
Heng Li
Department of Radiation Oncology, Johns Hopkins School of Medicine
K
Kang Wang
Department of Radiology & Biomedical Imaging, University of California, San Francisco
Y
Yang Yang
Department of Radiology & Biomedical Imaging, University of California, San Francisco
Yucheng Tang
Yucheng Tang
Sr. Research Scientist at NVIDIA
3D Computer VisionVision-Language ModelHealthcare AIAccelerated Computing
Daguang Xu
Daguang Xu
Senior Research Manager at NVIDIA
Deep LearningMachine LearningMedical Image AnalysisCompressive SensingSparse coding
A
Alan L. Yuille
Department of Computer Science, Johns Hopkins University
Z
Zongwei Zhou
Department of Computer Science, Johns Hopkins University