Meta-Learning Linear Models for Molecular Property Prediction

๐Ÿ“… 2025-09-16
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Chemical property prediction faces the dual challenge of limited small-sample data and the difficulty of balancing predictive performance with model interpretability. To address this, we propose a meta-learning-driven linear interpretable method: leveraging multi-task meta-learning to identify task-shared parameters and construct a universal functional manifold for initializing new tasksโ€”enabling knowledge transfer without sharing raw data. By integrating ridge regression with multi-task learning, the model preserves strict linearity, ensuring transparency and post-hoc explainability. Evaluated on multiple molecular property prediction benchmarks, our approach achieves 1.1ร—โ€“25ร— higher predictive accuracy than conventional linear models, markedly improving reliability and interpretability in low-data regimes. This work establishes a novel paradigm for eXplainable AI (XAI)-enabled chemical modeling, bridging the gap between data efficiency, generalization, and scientific interpretability.

Technology Category

Application Category

๐Ÿ“ Abstract
Chemists in search of structure-property relationships face great challenges due to limited high quality, concordant datasets. Machine learning (ML) has significantly advanced predictive capabilities in chemical sciences, but these modern data-driven approaches have increased the demand for data. In response to the growing demand for explainable AI (XAI) and to bridge the gap between predictive accuracy and human comprehensibility, we introduce LAMeL - a Linear Algorithm for Meta-Learning that preserves interpretability while improving the prediction accuracy across multiple properties. While most approaches treat each chemical prediction task in isolation, LAMeL leverages a meta-learning framework to identify shared model parameters across related tasks, even if those tasks do not share data, allowing it to learn a common functional manifold that serves as a more informed starting point for new unseen tasks. Our method delivers performance improvements ranging from 1.1- to 25-fold over standard ridge regression, depending on the domain of the dataset. While the degree of performance enhancement varies across tasks, LAMeL consistently outperforms or matches traditional linear methods, making it a reliable tool for chemical property prediction where both accuracy and interpretability are critical.
Problem

Research questions and friction points this paper is trying to address.

Addressing limited high-quality datasets for molecular property prediction
Bridging the gap between predictive accuracy and interpretability in chemistry
Leveraging meta-learning to improve performance across multiple chemical properties
Innovation

Methods, ideas, or system contributions that make the work stand out.

Meta-learning linear models for interpretability
Shared parameters across tasks without data sharing
Consistently outperforms traditional linear methods
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yulia Pimonova
Computing and Artificial Intelligence Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
Michael G. Taylor
Michael G. Taylor
Staff Scientist at Los Alamos National Laboratory
Computational ChemistryNovel MaterialsDensity Functional Theory
A
Alice Allen
Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545, USA; Max Planck Institute for Polymer Research, Ackermannweg 10, 55128 Mainz, Germany
P
Ping Yang
Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
Nicholas Lubbers
Nicholas Lubbers
Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory