Hybrid Machine Learning for Enhanced Prediction of Diffusion Coefficients in Liquids

πŸ“… 2026-03-03
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Experimental data on diffusion coefficients in liquids remain scarce, creating an urgent need for highly accurate and physically consistent prediction methods. This work proposes ESE, the first hybrid model that rigorously adheres to fundamental physical constraints by integrating the Stokes-Einstein equation with machine learning. Requiring only molecular SMILES strings as input, ESE accurately predicts infinite-dilution diffusion coefficients of solutes in pure solvents. The model demonstrates high accuracy across a wide temperature range and diverse chemical systems, significantly outperforming the current state-of-the-art method, SEGWE, on a large-scale literature dataset. ESE combines broad applicability with open-source availability, ensuring reproducibility and facilitating further research in molecular transport properties.

Technology Category

Application Category

πŸ“ Abstract
Diffusion coefficients are key thermophysical properties for modeling mass transport in liquids, but experimental data are scarce, making reliable prediction methods indispensable. In the present work, we introduce a new method for predicting diffusion coefficients of molecular components at infinite dilution in pure liquid solvents by integrating the Stokes-Einstein (SE) equation with machine learning (ML). Unlike previous ML approaches, the resulting hybrid Enhanced Stokes-Einstein (ESE) model provides strictly physically consistent predictions for diffusion coefficients as a function of temperature across a broad range of binary mixtures. Trained and validated using an extensive compilation of literature data for infinite-dilution diffusion coefficients in binary liquid systems, ESE achieves significantly higher prediction accuracies than the previous state-of-the-art model, SEGWE, while requiring only the SMILES strings encoding of the molecular formulae of the components of interest as additional inputs, which are always available. This simplicity makes ESE broadly applicable, e.g., for process design and optimization. The ESE model and its source code are fully disclosed and are directly accessible via an interactive web interface at https://ml-prop.mv.rptu.de/.
Problem

Research questions and friction points this paper is trying to address.

diffusion coefficients
liquids
prediction
machine learning
thermophysical properties
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid Machine Learning
Stokes-Einstein Equation
Diffusion Coefficients
Infinite Dilution
SMILES Representation
J
Jens Wagner
Laboratory of Engineering Thermodynamics (LTD), RPTU Kaiserslautern, Germany
Z
Zeno Romero
Laboratory of Engineering Thermodynamics (LTD), RPTU Kaiserslautern, Germany
K
Kerstin MΓΌnnemann
Laboratory of Engineering Thermodynamics (LTD), RPTU Kaiserslautern, Germany
Sebastian Schmitt
Sebastian Schmitt
Honda Research Institute Europe GmbH
Real-world machine learning applicationsanomaly detectionoptimizationquantum physicsquantum computing
T
Thomas Specht
Laboratory of Engineering Thermodynamics (LTD), RPTU Kaiserslautern, Germany
Hans Hasse
Hans Hasse
University of Kaiserslautern
Chemical Engineering
Fabian Jirasek
Fabian Jirasek
Laboratory of Engineering Themodynamics (LTD), RPTU Kaiserslautern
Chemical EngineeringBioprocess EngineeringThermodynamicsMachine Learning