MCGM: Multi-stage Clustered Global Modeling for Long-range Interactions in Molecules

📅 2025-09-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Geometric graph neural networks (GNNs) struggle to capture long-range molecular interactions due to their inherent local message-passing mechanism. To address this, we propose a multi-stage dynamic hierarchical clustering framework that constructs multi-resolution atomic clusters, enabling lightweight global contextual modeling and iterative feature refinement. Our approach is the first to jointly integrate learnable clustering, residual connections, and geometric GNNs—without requiring enlarged cutoff radii, intricate physics-based kernels, or Fourier basis functions—ensuring plug-and-play compatibility. On the OE62 dataset, our method reduces energy prediction error by 26.2%; on AQM, it achieves state-of-the-art accuracy of 17.0 meV (energy) and 4.9 meV/Å (force), while reducing model parameters by 20%.

Technology Category

Application Category

📝 Abstract
Geometric graph neural networks (GNNs) excel at capturing molecular geometry, yet their locality-biased message passing hampers the modeling of long-range interactions. Current solutions have fundamental limitations: extending cutoff radii causes computational costs to scale cubically with distance; physics-inspired kernels (e.g., Coulomb, dispersion) are often system-specific and lack generality; Fourier-space methods require careful tuning of multiple parameters (e.g., mesh size, k-space cutoff) with added computational overhead. We introduce Multi-stage Clustered Global Modeling (MCGM), a lightweight, plug-and-play module that endows geometric GNNs with hierarchical global context through efficient clustering operations. MCGM builds a multi-resolution hierarchy of atomic clusters, distills global information via dynamic hierarchical clustering, and propagates this context back through learned transformations, ultimately reinforcing atomic features via residual connections. Seamlessly integrated into four diverse backbone architectures, MCGM reduces OE62 energy prediction error by an average of 26.2%. On AQM, MCGM achieves state-of-the-art accuracy (17.0 meV for energy, 4.9 meV/Å for forces) while using 20% fewer parameters than Neural P3M. Code will be made available upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Modeling long-range molecular interactions with geometric graph neural networks
Overcoming computational cost and parameter tuning limitations in existing methods
Enhancing molecular property prediction accuracy through hierarchical global context
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical atomic clustering for global context
Dynamic clustering distills multi-resolution molecular information
Plug-and-play module enhances geometric GNNs via residuals
🔎 Similar Papers
No similar papers found.
H
Haodong Pan
State Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, and Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an, 710049, Shanxi, China
Yusong Wang
Yusong Wang
Tokyo Institute of Technology
Representation LearningAffective Computing
Nanning Zheng
Nanning Zheng
Xi'an Jiaotong University
C
Caijui Jiang
State Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, and Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an, 710049, Shanxi, China