DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making

📅 2026-05-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

177K/year
🤖 AI Summary
Current multimodal large language models face limitations in dermatological image analysis due to insufficient domain knowledge and susceptibility to hallucinations, hindering reliable and traceable diagnoses. This work proposes the first multi-tool collaborative agent system equipped with a planning–execution–reflection mechanism, integrating visual perception, bimodal evidence retrieval—leveraging a corpus of 413,210 images and 3,199 clinical guidelines—and a certainty-aware critique module that evaluates confidence, coverage, and conflict detection. The resulting framework enables fine-grained, interpretable, and hallucination-resistant decision-making. Evaluated across five dermatology benchmarks, the method substantially outperforms existing models, achieving a 17.6% improvement in diagnostic accuracy over GPT-4o and a 3.15% gain in ROUGE-L score.
📝 Abstract
Dermatological diagnosis requires integrating fine-grained visual perception with expert clinical knowledge. Although Multimodal Large Language Models (MLLMs) facilitate interactive medical image analysis, their application in dermatology is hindered by insufficient domain-specific grounding and hallucinations. To address these issues, we propose DermAgent, a collaborative multi-tool agent that orchestrates seven specialized vision and language modules within a Plan-Execute-Reflect framework. DermAgent delivers stepwise, traceable diagnostic reasoning through three core components. First, it employs complementary visual perception tools for comprehensive morphological description, dermoscopic concept annotation, and disease diagnosis. Second, to overcome the lack of domain prior, a dual-modality retrieval module anchors every prediction in external evidence by cross-referencing 413,210 diagnosed image cases and 3,199 clinical guideline chunks. To further mitigate hallucinations, a deterministic critic module conducts strict post-hoc auditing via confidence, coverage, and conflict gates, automatically detecting inter-source disagreements to trigger targeted self-correction. Extensive experiments on five dermatology benchmarks demonstrate that DermAgent consistently outperforms state-of-the-art MLLMs and medical agent baselines across zero-shot fine-grained disease diagnosis, concept annotation, and clinical captioning tasks, exceeding GPT-4o by 17.6% in skin disease diagnostic accuracy and 3.15% in captioning ROUGE-L. Our code is available at https://github.com/YizeezLiu/DermAgent.
Problem

Research questions and friction points this paper is trying to address.

dermatological diagnosis
multimodal large language models
hallucination
domain-specific grounding
medical image analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

self-reflective agentic system
multi-tool reasoning
traceable decision-making
dual-modality retrieval
deterministic critic
Y
Yize Liu
AIM for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia; Faculty of Information Technology, Monash University, Melbourne, Australia
Siyuan Yan
Siyuan Yan
Research Fellow@Monash University
AI for MedicineFoundation Model
Ming Hu
Ming Hu
Monash University | Shanghai AI Laboratory
Lie Ju
Lie Ju
University College London; Moorfields Eye Hospital; Monash University
Computer VisionMedical Image AnalysisOphthalmology
X
Xieji Li
AIM for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia; Faculty of Information Technology, Monash University, Melbourne, Australia
Feilong Tang
Feilong Tang
Monash University
Computer VisionFoundation ModelMedical Image Analysis
W
Wei Feng
AIM for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia; Faculty of Information Technology, Monash University, Melbourne, Australia
Z
Zongyuan Ge
AIM for Health Lab, Faculty of Information Technology, Monash University, Melbourne, Australia; Faculty of Information Technology, Monash University, Melbourne, Australia