🤖 AI Summary
Existing image generation models often produce implausible virtual cells that violate fundamental physical and biological constraints. To address this limitation, this work proposes CellFluxRL—a reinforcement learning–based post-training framework that introduces, for the first time, a seven-dimensional biological reward mechanism encompassing functional fidelity, structural validity, and morphological correctness to steer the generation process from mere visual realism toward genuine biological plausibility. Experimental results demonstrate that CellFluxRL significantly outperforms the original CellFlux model across all metrics of biological reasonableness, with further performance gains achieved through test-time scaling.
📝 Abstract
Building virtual cells with generative models to simulate cellular behavior in silico is emerging as a promising paradigm for accelerating drug discovery. However, prior image-based generative approaches can produce implausible cell images that violate basic physical and biological constraints. To address this, we propose to post-train virtual cell models with reinforcement learning (RL), leveraging biologically meaningful evaluators as reward functions. We design seven rewards spanning three categories-biological function, structural validity, and morphological correctness-and optimize the state-of-the-art CellFlux model to yield CellFluxRL. CellFluxRL consistently improves over CellFlux across all rewards, with further performance boosts from test-time scaling. Overall, our results present a virtual cell modeling framework that enforces physically-based constraints through RL, advancing beyond "visually realistic" generations towards "biologically meaningful" ones.