SkinCaRe: A Multimodal Dermatology Dataset Annotated with Medical Caption and Chain-of-Thought Reasoning

📅 2024-05-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing dermatological datasets lack concept-level metadata and clinically grounded natural language descriptions, hindering the interpretability of multimodal large models in skin diagnosis. To address this, we introduce SkinDx—the first dermatology-specific multimodal dataset integrating medical images, fine-grained natural language descriptions, and clinical chain-of-thought (CoT) reasoning—comprising 7,041 cases annotated by board-certified dermatologists and validated through multiple clinical review rounds. We propose a dual-component framework (SkinCAP + SkinCoT), pioneering a hierarchical, clinically verifiable CoT annotation paradigm, supported by a six-dimensional quality assessment and iterative refinement pipeline. Built upon Fitzpatrick-17k and Diverse Dermatology with targeted expansion, SkinDx is publicly released on Hugging Face. Experiments demonstrate substantial improvements in vision-language large models’ lesion description accuracy, diagnostic logical interpretability, and clinical credibility—establishing a new benchmark for healthcare multimodal AI.

Technology Category

Application Category

📝 Abstract
With the widespread application of artificial intelligence (AI), particularly deep learning (DL) and vision large language models (VLLMs), in skin disease diagnosis, the need for interpretability becomes crucial. However, existing dermatology datasets are limited in their inclusion of concept-level meta-labels, and none offer rich medical descriptions in natural language. This deficiency impedes the advancement of LLM-based methods in dermatologic diagnosis. To address this gap and provide a meticulously annotated dermatology dataset with comprehensive natural language descriptions, we introduce extbf{SkinCaRe}, a comprehensive multimodal resource that unifies extit{SkinCAP} and extit{SkinCoT}. extbf{SkinCAP} comprises 4,000 images sourced from the Fitzpatrick 17k skin disease dataset and the Diverse Dermatology Images dataset, annotated by board-certified dermatologists to provide extensive medical descriptions and captions. In addition, we introduce extbf{SkinCoT}, a curated dataset pairing 3,041 dermatologic images with clinician-verified, hierarchical chain-of-thought (CoT) diagnoses. Each diagnostic narrative is rigorously evaluated against six quality criteria and iteratively refined until it meets a predefined standard of clinical accuracy and explanatory depth. Together, SkinCAP (captioning) and SkinCoT (reasoning), collectively referred to as SkinCaRe, encompass 7,041 expertly curated dermatologic cases and provide a unified and trustworthy resource for training multimodal models that both describe and explain dermatologic images. SkinCaRe is publicly available at https://huggingface.co/datasets/yuhos16/SkinCaRe.
Problem

Research questions and friction points this paper is trying to address.

Addresses lack of dermatology datasets with medical descriptions
Provides annotated skin disease images with expert captions
Introduces chain-of-thought reasoning for diagnostic explanations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Created multimodal dataset with medical captions
Introduced chain-of-thought reasoning for diagnosis
Combined expert-curated images with clinical explanations
🔎 Similar Papers
No similar papers found.
Juexiao Zhou
Juexiao Zhou
Assistant Professor, The Chinese University of Hong Kong, Shenzhen
AI for HealthcareEthical AIBioinformaticsPrivacyAGI
L
Liyuan Sun
Department of Dermatology, Beijing AnZhen Hospital, Affiliated to Capital Medical University, Beijing 100029, China
Y
Yan Xu
Department of Dermatology, Tianjin Institute of Integrative Dermatology,Tianjin Academy of Traditional Chinese Medicine Affiliated Hospital, China
W
Wenbin Liu
Department of Dermatology, Beijing Aerospace General Hospital, China
S
Shawn Afvari
DermAssure, LLC, New York, NY, USA
Zhongyi Han
Zhongyi Han
Professor, Shandong University
Machine LearningAgentic AIAI for Science
J
Jiaoyan Song
Capital Medical University, Beijing 100029, China
Y
Yongzhi Ji
Department of Dermatology, Second Hospital of Jilin University, 218 Ziqiang Street, Changchun 130041, China
X
Xiaonan He
Emergency Critical Care Center, Beijing AnZhen Hospital, Affiliated to Capital Medical University, Beijing 100029, China
X
Xin Gao
Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia