OrgFlow: Generative Modeling of Organic Crystal Structures from Molecular Graphs

📅 2026-02-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Organic crystal structure prediction has long suffered from the absence of efficient data-driven approaches due to challenges such as large unit cells and strict chemical bonding constraints. This work addresses this gap by introducing flow matching to the field for the first time, proposing a molecular graph-based generative model that explicitly incorporates periodic boundary conditions and crystallographic symmetries. To ensure chemically realistic local environments, the authors design a bond-aware loss function that enforces correct atomic connectivity. A dedicated organic crystal dataset and an efficient preprocessing pipeline are also developed to support model training. Experimental results demonstrate that the proposed method achieves over tenfold improvement in matching rate compared to existing baselines while requiring fewer sampling steps, substantially enhancing both the efficiency and accuracy of crystal structure prediction.

Technology Category

Application Category

📝 Abstract
Crystal structure prediction is a long-standing challenge in materials science, with most data-driven methods developed for inorganic systems. This leaves an important gap for organic crystals, which are central to pharmaceuticals, polymers, and functional materials, but present unique challenges, such as larger unit cells and strict chemical connectivity. We introduce a flow-matching model for predicting organic crystal structures directly from molecular graphs. The architecture integrates molecular connectivity with periodic boundary conditions while preserving the symmetries of crystalline systems. A bond-aware loss guides the model toward realistic local chemistry by enforcing distributions of bond lengths and connectivity. To support reliable and efficient training, we built a curated dataset of organic crystals, along with a preprocessing pipeline that precomputes bonds and edges, substantially reducing computational overhead during both training and inference. Experiments show that our method achieves a Match Rate more than 10 times higher than existing baselines while requiring fewer sampling steps for inference. These results establish generative modeling as a practical and scalable framework for organic crystal structure prediction.
Problem

Research questions and friction points this paper is trying to address.

organic crystal structure prediction
molecular graphs
crystal structure prediction
generative modeling
chemical connectivity
Innovation

Methods, ideas, or system contributions that make the work stand out.

flow-matching
organic crystal structure prediction
molecular graph
bond-aware loss
generative modeling
🔎 Similar Papers
No similar papers found.
M
Mohammadmahdi Vahediahmar
Department of Computer Science, Drexel University
M
Matthew A. McDonald
Department of Chemical and Biological Engineering, Drexel University
Feng Liu
Feng Liu
Assistant Professor, Drexel University
Computer VisionPattern RecognitionMachine LearningBiometrics