MieDB-100k: A Comprehensive Dataset for Medical Image Editing

πŸ“… 2026-02-10
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the scarcity of high-quality, large-scale multimodal datasets that jointly support understanding and generation in medical image editingβ€”a key bottleneck hindering the advancement of generative models in this domain. To bridge this gap, the study introduces the first systematic categorization of medical image editing tasks into three types: perception, modification, and transformation. Building upon this framework, the authors construct MieDB-100k, a dataset comprising 100,000 samples generated via modality-specific expert models and rule-driven synthesis, followed by rigorous human validation to ensure clinical fidelity and diversity. Models trained on MieDB-100k demonstrate significantly superior performance and generalization compared to existing open- and closed-source alternatives, establishing a robust foundation for future research in medical image editing.

Technology Category

Application Category

πŸ“ Abstract
The scarcity of high-quality data remains a primary bottleneck in adapting multimodal generative models for medical image editing. Existing medical image editing datasets often suffer from limited diversity, neglect of medical image understanding and inability to balance quality with scalability. To address these gaps, we propose MieDB-100k, a large-scale, high-quality and diverse dataset for text-guided medical image editing. It categorizes editing tasks into perspectives of Perception, Modification and Transformation, considering both understanding and generation abilities. We construct MieDB-100k via a data curation pipeline leveraging both modality-specific expert models and rule-based data synthetic methods, followed by rigorous manual inspection to ensure clinical fidelity. Extensive experiments demonstrate that model trained with MieDB-100k consistently outperform both open-source and proprietary models while exhibiting strong generalization ability. We anticipate that this dataset will serve as a cornerstone for future advancements in specialized medical image editing.
Problem

Research questions and friction points this paper is trying to address.

medical image editing
data scarcity
dataset diversity
multimodal generative models
clinical fidelity
Innovation

Methods, ideas, or system contributions that make the work stand out.

medical image editing
text-guided generation
MieDB-100k
data curation
multimodal generative models
πŸ”Ž Similar Papers
No similar papers found.
Yongfan Lai
Yongfan Lai
Peking University
W
Wen Qian
DAMO Academy, Alibaba Group, Zhejiang, China; Hupan Lab, Zhejiang Province, China; Zhejiang University, Zhejiang, China
B
Bo Liu
State Key Laboratory of General Artificial Intelligence, Beijing, China; School of Intelligence Science and Technology, Peking University, Beijing, China
H
Hongyan Li
State Key Laboratory of General Artificial Intelligence, Beijing, China; School of Intelligence Science and Technology, Peking University, Beijing, China
Hao Luo
Hao Luo
Alibaba DAMO Academy
computer vision
Fan Wang
Fan Wang
Alibaba DAMO Academy
Computer VisionMachine Learning
Bohan Zhuang
Bohan Zhuang
Zhejiang University
Efficient AIMLSys
Shenda Hong
Shenda Hong
Assistant Professor, Peking University
AI ECGBiosignalAI for Digital HealthHealth Data ScienceAI for Healthcare