DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information

📅 2025-01-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the scarcity and privacy sensitivity of electrocardiogram (ECG) data, as well as weak semantic alignment and unsystematic evaluation in existing generative methods, this work proposes the first diffusion-based framework for generating 12-lead ECGs conditioned jointly on clinical text reports and structured patient metadata. We introduce a novel cross-modal conditional diffusion architecture that integrates BERT-encoded textual descriptions with patient-specific feature embeddings, and incorporate a multi-scale temporal reconstruction loss to enhance signal fidelity and medical plausibility. Furthermore, we establish the first standardized benchmark for ECG generation, evaluating models across three dimensions: signal quality, semantic consistency, and clinical interpretability. Experiments demonstrate significant improvements over state-of-the-art methods across multiple quantitative metrics; expert blind evaluation confirms the clinical validity of generated ECGs; and the framework successfully enables new applications—including data augmentation, medical education case generation, and exploratory analysis of abnormal patterns.

Technology Category

Application Category

📝 Abstract
Heart disease remains a significant threat to human health. As a non-invasive diagnostic tool, the electrocardiogram (ECG) is one of the most widely used methods for cardiac screening. However, the scarcity of high-quality ECG data, driven by privacy concerns and limited medical resources, creates a pressing need for effective ECG signal generation. Existing approaches for generating ECG signals typically rely on small training datasets, lack comprehensive evaluation frameworks, and overlook potential applications beyond data augmentation. To address these challenges, we propose DiffuSETS, a novel framework capable of generating ECG signals with high semantic alignment and fidelity. DiffuSETS accepts various modalities of clinical text reports and patient-specific information as inputs, enabling the creation of clinically meaningful ECG signals. Additionally, to address the lack of standardized evaluation in ECG generation, we introduce a comprehensive benchmarking methodology to assess the effectiveness of generative models in this domain. Our model achieve excellent results in tests, proving its superiority in the task of ECG generation. Furthermore, we showcase its potential to mitigate data scarcity while exploring novel applications in cardiology education and medical knowledge discovery, highlighting the broader impact of our work.
Problem

Research questions and friction points this paper is trying to address.

ECG signal generation
data privacy
medical resource constraints
Innovation

Methods, ideas, or system contributions that make the work stand out.

DiffuSETS
ECG signal generation
medical data augmentation
🔎 Similar Papers
No similar papers found.
Yongfan Lai
Yongfan Lai
Peking University
Jiabo Chen
Jiabo Chen
Nankai University
Computer VisionContinual LearningHealth Data ScienceAI for Healthcare
D
Deyun Zhang
HeartVoice Medical Technology, Hefei, China
Y
Yue Wang
HeartVoice Medical Technology, Hefei, China
Shijia Geng
Shijia Geng
University of Miami
Signal ProcessingArtificial IntelligenceMachine LearningNeural NetworkBrain Machine Interface
H
Hongyan Li
State Key Laboratory of General Artificial Intelligence, Beijing, China; School of Intelligence Science and Technology, Peking University, Beijing, China
Shenda Hong
Shenda Hong
Assistant Professor, Peking University
AI ECGBiosignalAI for Digital HealthHealth Data ScienceAI for Healthcare