CardAIc-Agents: A Multimodal Framework with Hierarchical Adaptation for Cardiac Care Support

📅 2025-08-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Cardiovascular diseases impose a substantial global health burden, yet early intervention is hindered by healthcare resource shortages and the limited clinical adaptability of existing AI agents. To address this, we propose a multimodal adaptive intelligent agent framework tailored for cardiac diagnosis and treatment. Methodologically, it features: (1) a hierarchical dynamic decision-making architecture enabling personalized pathway planning based on real-time detection outputs; (2) integrated CardiacRAG—a retrieval-augmented knowledge base—complemented by multidisciplinary discussion tools and an interactive visualization review panel to support clinical-feedback-driven continual optimization; and (3) synergistic fusion of vision-language models, external tool invocation, multi-agent collaboration, and on-demand multimodal generation—including medical image synthesis. Extensive experiments across three clinical datasets demonstrate significant improvements in task execution efficiency and clinical alignment, outperforming state-of-the-art vision-language models, advanced agent systems, and fine-tuned baseline models.

Technology Category

Application Category

📝 Abstract
Cardiovascular diseases (CVDs) remain the foremost cause of mortality worldwide, a burden worsened by a severe deficit of healthcare workers. Artificial intelligence (AI) agents have shown potential to alleviate this gap via automated early detection and proactive screening, yet their clinical application remains limited by: 1) prompt-based clinical role assignment that relies on intrinsic model capabilities without domain-specific tool support; or 2) rigid sequential workflows, whereas clinical care often requires adaptive reasoning that orders specific tests and, based on their results, guides personalised next steps; 3) general and static knowledge bases without continuous learning capability; and 4) fixed unimodal or bimodal inputs and lack of on-demand visual outputs when further clarification is needed. In response, a multimodal framework, CardAIc-Agents, was proposed to augment models with external tools and adaptively support diverse cardiac tasks. Specifically, a CardiacRAG agent generated general plans from updatable cardiac knowledge, while the chief agent integrated tools to autonomously execute these plans and deliver decisions. To enable adaptive and case-specific customization, a stepwise update strategy was proposed to dynamically refine plans based on preceding execution results, once the task was assessed as complex. In addition, a multidisciplinary discussion tool was introduced to interpret challenging cases, thereby supporting further adaptation. When clinicians raised concerns, visual review panels were provided to assist final validation. Experiments across three datasets showed the efficiency of CardAIc-Agents compared to mainstream Vision-Language Models (VLMs), state-of-the-art agentic systems, and fine-tuned VLMs.
Problem

Research questions and friction points this paper is trying to address.

Addressing healthcare worker shortage via AI for cardiac care
Overcoming limitations in clinical AI role assignment and workflows
Enhancing adaptive reasoning and multimodal capabilities in diagnosis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal framework integrating external tools
Hierarchical agents with adaptive stepwise updates
Dynamic visual review panels for validation
🔎 Similar Papers
No similar papers found.
Yuting Zhang
Yuting Zhang
HKUST(GZ)
rPPGComputer Vision
K
Karina V. Bunting
Department of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
A
Asgher Champsi
Department of Cardiovascular Sciences, University of Birmingham, Birmingham, UK
X
Xiaoxia Wang
Department of Cardiovascular Sciences, University of Birmingham, Birmingham, UK; NIHR Birmingham Biomedical Research Centre and West Midlands NHS Secure Data Environment, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
Wenqi Lu
Wenqi Lu
Manchester Metropolitan University
medical image analysiscomputational pathologydeep learninginverse problem
A
Alexander Thorley
School of Computer Science, University of Birmingham, Birmingham, UK
S
Sandeep S Hothi
Department of Cardiology, Heart and Lung Centre, Royal Wolverhampton NHS Trust, Wolverhampton, UK
Z
Zhaowen Qiu
College of Computer and Control Engineering, Northeast Forestry University, Harbin, China
Dipak Kotecha
Dipak Kotecha
Professor of Cardiology
cardiology
J
Jinming Duan
School of Computer Science, University of Birmingham, Birmingham, UK; Division of Informatics, Imaging and Data Sciences, University of Manchester, Manchester, UK