Genetic-guided GFlowNets for Sample Efficient Molecular Optimization

📅 2024-02-05
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Molecular optimization in drug and materials discovery suffers from expensive reward function evaluations and low sample efficiency. Method: We propose a novel domain-knowledge-enhanced GFlowNets framework that distills prior knowledge from genetic algorithms—specifically selection, crossover, and mutation operators—into the GFlowNets generative policy. This yields an off-policy, amortized graph-structured molecular search framework, integrating graph neural networks with flow matching-based training. Contribution/Results: Our method achieves state-of-the-art performance on standard molecular optimization benchmarks. In SARS-CoV-2 inhibitor design, it reduces reward function calls by over 50%, significantly improving optimization efficiency and computational scalability. Crucially, it successfully embeds the inductive biases of classical evolutionary algorithms into a deep generative model, unifying high sample efficiency with strong generalization capability.

Technology Category

Application Category

📝 Abstract
The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy using GFlowNets training, the off-policy method for amortized inference. This approach enables the deep generative policy to learn from domain knowledge, which has been explicitly integrated into the genetic algorithm. Our method achieves state-of-the-art performance in the official molecular optimization benchmark, significantly outperforming previous methods. It also demonstrates effectiveness in designing inhibitors against SARS-CoV-2 with substantially fewer reward calls.
Problem

Research questions and friction points this paper is trying to address.

Molecular Optimization
Computational Efficiency
Drug Discovery
Innovation

Methods, ideas, or system contributions that make the work stand out.

GFlowNets
Genetic Algorithm
Deep Learning
🔎 Similar Papers
H
Hyeon-Seob Kim
Korea Advanced Institute of Science and Technology (KAIST)
M
Minsu Kim
Korea Advanced Institute of Science and Technology (KAIST)
Sanghyeok Choi
Sanghyeok Choi
University of Edinburgh
Deep Learning
Jinkyoo Park
Jinkyoo Park
Department of Industrial and Systems Engineering, KAIST
Machine LearningGame TheoryOptimal Control