Fine-tuning Pocket-Aware Diffusion Models via Denoising Policy Optimization

📅 2026-05-17
📈 Citations: 0
Influential: 0
📄 PDF

career value

212K/year
🤖 AI Summary
Existing pocket-aware 3D molecular generation methods struggle to simultaneously optimize multiple objectives, including binding affinity, drug-likeness, synthesizability, and diversity. This work addresses this challenge by formulating the denoising diffusion process as a multi-step Markov decision process and integrating reinforcement learning with a multi-objective reward mechanism. The authors propose a coarse-grained denoising scheduling strategy that enables fine-grained control over molecular properties while preserving generation efficiency. Evaluated on the CrossDocked2020 benchmark, the proposed method significantly outperforms current baselines, achieving a Vina docking score of −8.5 kcal/mol and demonstrating superior performance in drug-likeness, molecular diversity, and synthesizability.
📝 Abstract
Structure-based drug design has been accelerated by pocket-aware 3D generative models, yet most methods primarily fit the training distribution and may fall short of satisfying multiple properties required in real-world therapeutic drug discovery. Recently, increasing attention has focused on structure-based molecule optimization (SBMO), which targets fine-grained control over multiple specified molecular properties. In this paper, we present DEPPA, a novel SBMO approach building upon Denoising Diffusion Policy Optimization for fine-tuning a pre-trained pocket-aware diffusion model via reinforcement learning. DEPPA enables optimization over multiple properties, including binding affinity, drug-likeness, synthesizability and diversity. We formulate the reverse denoising process of the pretrained pocket-aware diffusion model as a multi-step Markov Decision Process, where the desired properties that serve as reward signals are evaluated on the final generated ligand molecules. DEPPA incorporates a coarse denoising scheduler during the RL fine-tuning to achieve efficient and effective molecule optimization. Experimental results on the CrossDocked2020 benchmark demonstrate that DEPPA outperforms baselines in binding affinity (Vina Score -8.5 kcal/mol), drug-likeness and diversity while exhibiting competitive performance in synthesizability. The source code is available at https://github.com/xy9485/DePPA .
Problem

Research questions and friction points this paper is trying to address.

structure-based drug design
pocket-aware generation
molecule optimization
multi-property optimization
binding affinity
Innovation

Methods, ideas, or system contributions that make the work stand out.

pocket-aware diffusion
reinforcement learning
structure-based molecule optimization
denoising policy optimization
multi-property optimization
🔎 Similar Papers
No similar papers found.