A Reinforcement Learning Approach to Synthetic Data Generation

📅 2025-12-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of strong training-data dependency and the privacy–utility trade-off in synthetic data generation (SDG) for few-shot biomedical applications, this paper pioneers a reinforcement learning (RL) formulation of SDG. We propose a stable training paradigm leveraging policy-gradient optimization with discriminator-based reward feedback, employing proximal policy optimization (PPO) to directly optimize stochastic generation policies—eliminating the need for large-scale pretraining or intricate loss design. Evaluated on the AI-READI few-shot benchmark, our method significantly outperforms both GANs and diffusion models; on MIMIC-IV, it achieves utility comparable to diffusion models and superior to GANs, while attaining optimal balance across privacy preservation, data utility, and fidelity. Our core contribution is establishing RL as a novel, principled paradigm for SDG and empirically demonstrating its effectiveness and robustness in resource-constrained clinical settings.

Technology Category

Application Category

📝 Abstract
Synthetic data generation (SDG) is a promising approach for enabling data sharing in biomedical studies while preserving patient privacy. Yet, state-of-the-art generative models often require large datasets and complex training procedures, limiting their applicability in small-sample settings. In this work, we reframe SDG as a reinforcement learning (RL) problem and introduce RLSyn, a novel framework that models the data generator as a stochastic policy over patient records and optimizes it using Proximal Policy Optimization with discriminator-derived rewards, yielding more stable and data-efficient training. We evaluate RLSyn on two biomedical datasets - AI-READI and MIMIC-IV- and benchmark it against state-of-the-art generative adversarial networks (GANs) and diffusion-based methods across extensive privacy, utility, and fidelity evaluations. RL-Syn performs comparably to diffusion models and outperforms GANs on MIMIC-IV, while outperforming both diffusion models and GANs on the smaller AI-READI dataset. These results demonstrate that reinforcement learning provides a principled and effective alternative for synthetic biomedical data generation, particularly in data-scarce regimes.
Problem

Research questions and friction points this paper is trying to address.

Develops a reinforcement learning framework for synthetic biomedical data generation
Addresses limitations of existing models in small-sample settings
Evaluates privacy, utility, and fidelity against GANs and diffusion models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning models data generator as policy
Proximal Policy Optimization with discriminator rewards for training
Framework yields stable data-efficient training for small datasets
🔎 Similar Papers
No similar papers found.
N
Natalia Espinosa-Dice
Department of Computer Science, Princeton University, Princeton, NJ, USA
N
Nicholas J. Jackson
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
Chao Yan
Chao Yan
Instructor at DBMI, VUMC; CS PhD from Vanderbilt U
AI for medicineSynthetic health dataPrivacyFairness
Aaron Lee
Aaron Lee
Assistant Director Health and Imaging Informatics
Computational ChemistryArtificial IntelligenceMedical Imaging
B
Bradley A. Malin
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA; Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA; Department of Computer Science, Vanderbilt University, Nashville, TN, USA