End-to-End Framework Integrating Generative AI and Deep Reinforcement Learning for Autonomous Ultrasound Scanning

📅 2025-10-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Cardiac ultrasound diagnosis suffers from operator dependence, geographic disparities in resource availability, and human-induced variability—necessitating reproducible, scalable automation. This paper proposes the first end-to-end autonomous scanning framework integrating generative AI with deep reinforcement learning (DRL). Specifically, we co-train a conditional generative adversarial network (cGAN) coupled with a variational autoencoder (VAE) to synthesize high-fidelity simulated ultrasound images, jointly optimized with a DRL policy for closed-loop, real-time scanning path planning. An integrated image quality assessment and classification module ensures output consistency and diagnostic relevance. We further release the first publicly available, annotated real-world cardiac ultrasound dataset. Experiments demonstrate robust generation of high-quality scanning trajectories across diverse configurations, substantially reducing reliance on expert knowledge. The framework exhibits strong cross-organ generalizability, reproducibility, and clinical deployability.

Technology Category

Application Category

📝 Abstract
Cardiac ultrasound (US) is among the most widely used diagnostic tools in cardiology for assessing heart health, but its effectiveness is limited by operator dependence, time constraints, and human error. The shortage of trained professionals, especially in remote areas, further restricts access. These issues underscore the need for automated solutions that can ensure consistent, and accessible cardiac imaging regardless of operator skill or location. Recent progress in artificial intelligence (AI), especially in deep reinforcement learning (DRL), has gained attention for enabling autonomous decision-making. However, existing DRL-based approaches to cardiac US scanning lack reproducibility, rely on proprietary data, and use simplified models. Motivated by these gaps, we present the first end-to-end framework that integrates generative AI and DRL to enable autonomous and reproducible cardiac US scanning. The framework comprises two components: (i) a conditional generative simulator combining Generative Adversarial Networks (GANs) with Variational Autoencoders (VAEs), that models the cardiac US environment producing realistic action-conditioned images; and (ii) a DRL module that leverages this simulator to learn autonomous, accurate scanning policies. The proposed framework delivers AI-driven guidance through expert-validated models that classify image type and assess quality, supports conditional generation of realistic US images, and establishes a reproducible foundation extendable to other organs. To ensure reproducibility, a publicly available dataset of real cardiac US scans is released. The solution is validated through several experiments. The VAE-GAN is benchmarked against existing GAN variants, with performance assessed using qualitative and quantitative approaches, while the DRL-based scanning system is evaluated under varying configurations to demonstrate effectiveness.
Problem

Research questions and friction points this paper is trying to address.

Automating cardiac ultrasound scanning to reduce operator dependence
Integrating generative AI with deep reinforcement learning for reproducibility
Developing accessible autonomous imaging solutions for healthcare disparities
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines generative AI with deep reinforcement learning
Uses conditional GAN-VAE simulator for realistic images
Learns autonomous scanning policies through DRL module
🔎 Similar Papers
No similar papers found.
H
Hanae Elmekki
Concordia Institute for Information Systems Engineering, Concordia University, Montreal, Canada
A
Amanda Spilkin
Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada
E
Ehsan Zakeri
Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada
A
Antonela Mariel Zanuttini
Department of Medicine, Laval University, Quebec, Canada
Ahmed Alagha
Ahmed Alagha
Postdoctoral Fellow, FRQNT Scholar
Deep Reinforcement LearningImitation LearningRoboticsCrowdsourcing
H
Hani Sami
Department of Electrical, Computer, and Software Engineering, Ontario Tech University, Oshawa, Canada
Jamal Bentahar
Jamal Bentahar
Concordia University
Deep Reinforcement LearningFederated LearningMulti-Agent SystemsVerificationServices Computing and IoT
Lyes Kadem
Lyes Kadem
Concordia University
Cardiovascular Fluid Dynamics
W
Wen-Fang Xie
Department of Mechanical, Industrial and Aerospace Engineering, Concordia University, Montreal, Canada
Philippe Pibarot
Philippe Pibarot
Department of Medicine, Laval University, Quebec, Canada
Rabeb Mizouni
Rabeb Mizouni
Khalifa University
CrowdsensingMobile ComputingAISW engineering
Hadi Otrok
Hadi Otrok
Chair and Professor Computer Science, Khalifa University
Network & Computer SecurityBlockchain & Game TheoryReinforcement Learning
Azzam Mourad
Azzam Mourad
Lebanese American University - Khalifa University
CybersecurityFederated Machine LearningNetwork and Service ManagementApplied AIIoT and Fog
Sami Muhaidat
Sami Muhaidat
Professor, Khalifa University; Adjunct Professor, Carleton University
Wireless CommunicationsMachine LearningOptical Wireless CommunicationV2V