🤖 AI Summary
This work addresses the challenge of accurately predicting physicochemical properties of chemical mixtures, a task hindered by the inability of existing methods to jointly model intra-molecular interactions and the dynamic influence of mixture composition—such as concentration and ratios—on multi-scale structural features. To overcome this limitation, we propose ChemFlow, a hierarchical graph neural network framework that integrates concentration and compositional information through an atom-level feature fusion module (Chem-embed) and a bidirectional attention mechanism spanning atomic, functional group, and molecular levels. This enables context-aware, multi-scale representation learning tailored to complex mixtures. Experimental results demonstrate that ChemFlow significantly outperforms state-of-the-art models in both concentration-sensitive and concentration-agnostic systems, achieving notable advances in prediction accuracy and computational efficiency.
📝 Abstract
Accurate prediction of the physicochemical properties of molecular mixtures using graph neural networks remains a significant challenge, as it requires simultaneous embedding of intramolecular interactions while accounting for mixture composition (i.e., concentrations and ratios). Existing approaches are ill-equipped to emulate realistic mixture environments, where densely coupled interactions propagate across hierarchical levels - from atoms and functional groups to entire molecules - and where cross-level information exchange is continuously modulated by composition. To bridge the gap between isolated molecules and realistic chemical environments, we present ChemFlow, a novel hierarchical framework that integrates atomic, functional group, and molecular-level features, facilitating information flow across these levels to predict the behavior of complex chemical mixtures. ChemFlow employs an atomic-level feature fusion module, Chem-embed, to generate context-aware atomic representations influenced by the mixture state and atomic characteristics. Next, bidirectional group-to-molecule and molecule-to-group attention mechanisms enable ChemFlow to capture functional group interactions both within and across molecules in the mixture. By dynamically adjusting representations based on concentration and composition, ChemFlow excels at predicting concentration-dependent properties and significantly outperforms state-of-the-art models in both concentration-sensitive and concentration-independent systems. Extensive experiments demonstrate ChemFlow's superior accuracy and efficiency in modeling complex chemical mixtures.