QT-Net: Rethinking Evaluation of AI Models in Atomic Chemical Space

📅 2026-05-11
📈 Citations: 0
Influential: 0
📄 PDF

career value

198K/year
🤖 AI Summary
This work addresses the lack of principled, atomistic out-of-distribution (OOD) evaluation protocols for machine learning models, which hinders reliable assessment of their generalization to atomic properties such as partial charges and multipole moments. To this end, the authors propose a leave-one-cluster-out evaluation scheme based on SOAP descriptor clustering of atomic environments and introduce QT-Net, a rotation-augmented, non-equivariant graph neural network that incorporates quantum topological atom (QTA) properties as inductive bias to predict electron populations and multipole moments for H, C, N, and O atoms. Experiments demonstrate that QT-Net exhibits strong OOD generalization on QM9 molecules, accurately reconstructs molecular dipole moments from predicted atomic multipoles, and significantly enhances performance in downstream molecular property prediction tasks.
📝 Abstract
Atomic properties such as partial charges or multipoles encode chemically meaningful information that can inform downstream molecular property prediction, but their evaluation as machine learning targets has been complicated by the absence of a principled out-of-distribution evaluation protocol at the atomic level. In this work, we propose a held-out evaluation protocol that clusters atomic environments by SOAP descriptors and computes metrics accounting only for cluster labels unseen during training. Following this procedure, we use 5$\times$5 cross-validation and Tukey's HSD to run a statistically rigorous comparison of E(3)-equivariant against non-equivariant, rotationally augmented models for predicting electron populations and multipoles of H, C, N, and O atoms. Building on our results, we introduce the Quantum Topological Neural Network (QT-Net), a rotationally augmented, non-equivariant graph neural network. We show that QT-Net can be used to infer properties of atoms in molecules from QM9 outside our training set, and that these inferred properties can yield improvement when used as input features for downstream molecular property prediction. To further validate the framework, molecular dipole moments computed from QT-Net's per-atom outputs recover the ground-truth values reported in QM9. We release all code and data, including a JAX implementation of QT-Net, to support the broader use of learned QTA properties as inductive biases for atomic-scale molecular machine learning.
Problem

Research questions and friction points this paper is trying to address.

atomic properties
out-of-distribution evaluation
machine learning
molecular property prediction
evaluation protocol
Innovation

Methods, ideas, or system contributions that make the work stand out.

out-of-distribution evaluation
atomic chemical space
rotationally augmented GNN
quantum topological properties
SOAP clustering
🔎 Similar Papers
No similar papers found.
P
Pablo Martínez Crespo
Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg
S
Stefano Ribes
Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg
Martin Rahm
Martin Rahm
Associate Professor, Chalmers University of Technology
Theoretical ChemistryChemical BondingAstrobiologyHigh PressureEnergetic Materials
R
Richard Beckmann
Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg
R
Robert S. Jordan
Technology Research, Intel Corporation
M
Marisa Gliege
Chief Technology Office, EMD Electronics
Santiago Miret
Santiago Miret
Lila Sciences
V
Vijay Kris Narasimhan
Chief Technology Office, M Ventures
Rocío Mercado
Rocío Mercado
Chalmers University of Technology
molecular engineeringmachine learningdeep generative modelsdrug discoverymaterials discovery