Perspective: Towards sustainable exploration of chemical spaces with machine learning

📅 2026-03-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the sustainability challenges in AI-driven molecular and materials discovery, which are exacerbated by escalating computational and data demands. The authors propose a hierarchical intelligent workflow that integrates efficient modeling, multi-fidelity learning, active sampling, and physics-informed constraints to jointly optimize energy efficiency, reliability, synthesizability, and multiple performance objectives. By leveraging foundation models, knowledge distillation, and embedded physical priors, the framework substantially reduces the computational cost of quantum simulations and model training while maintaining predictive accuracy, thereby enhancing scientific output per unit of compute. This approach advances the responsible deployment of green AI in drug and materials discovery and promotes open data sharing and reusable pipelines, offering a sustainable pathway for domain-specific AI systems.
📝 Abstract
Artificial intelligence is transforming molecular and materials science, but its growing computational and data demands raise critical sustainability challenges. In this Perspective, we examine resource considerations across the AI-driven discovery pipeline--from quantum-mechanical (QM) data generation and model training to automated, self-driving research workflows--building on discussions from the ``SusML workshop: Towards sustainable exploration of chemical spaces with machine learning'' held in Dresden, Germany. In this context, the availability of large quantum datasets has enabled rigorous benchmarking and rapid methodological progress, while also incurring substantial energy and infrastructure costs. We highlight emerging strategies to enhance efficiency, including general-purpose machine learning (ML) models, multi-fidelity approaches, model distillation, and active learning. Moreover, incorporating physics-based constraints within hierarchical workflows, where fast ML surrogates are applied broadly and high-accuracy QM methods are used selectively, can further optimize resource use without compromising reliability. Equally important is bridging the gap between idealized computational predictions and real-world conditions by accounting for synthesizability and multi-objective design criteria, which is essential for practical impact. Finally, we argue that sustainable progress will rely on open data and models, reusable workflows, and domain-specific AI systems that maximize scientific value per unit of computation, enabling efficient and responsible discovery of technological materials and therapeutics.
Problem

Research questions and friction points this paper is trying to address.

sustainability
chemical space exploration
machine learning
computational cost
resource efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

sustainable machine learning
multi-fidelity modeling
active learning
physics-informed AI
chemical space exploration
🔎 Similar Papers
No similar papers found.
L
Leonardo Medrano Sandonas
Institute for Materials Science and Max Bergmann Center of Biomaterials, TUD Dresden University of Technology, 01062, Dresden, Germany
D
David Balcells
Hylleraas Centre for Quantum Molecular Sciences, Department of Chemistry, University of Oslo, P.O. Box 1033, Blindern, 0315 Oslo, Norway
A
Anton Bochkarev
Interdisciplinary Centre for Advanced Materials Simulation (ICAMS), Ruhr-University Bochum, 44780 Bochum, Germany
J
Jacqueline M. Cole
Ray Dolby Centre, Cavendish Laboratory, Department of Physics, University of Cambridge, J. J. Thomson Avenue, Cambridge, CB3 0US, United Kingdom
V
Volker L. Deringer
Inorganic Chemistry Laboratory, Department of Chemistry, University of Oxford, Oxford OX1 3QR, United Kingdom
W
Werner Dobrautz
Center for Advanced Systems Understanding, Helmholtz-Zentrum Dresden-Rossendorf, Germany; Center for Scalable Data Analytics and Artificial Intelligence Dresden/Leipzig, TU Dresden, Germany
Adrian Ehrenhofer
Adrian Ehrenhofer
Research Group Leader, Technische Universität Dresden
Modelling and simulation of active materialsMaterials Informatics
T
Thorben Frank
Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
Pascal Friederich
Pascal Friederich
Karlsruhe Institute of Technology
Machine LearningMaterials designGraph Neural NetworksComputational chemistryMultiscale modeling
R
Rico Friedrich
Theoretical Chemistry, TUD Dresden University of Technology, Germany; Institute of Ion Beam Physics and Materials Research, Helmholtz-Zentrum Dresden-Rossendorf, Germany
J
Janine George
Federal Institute for Materials Research and Testing (BAM), Unter den Eichen 87, 12205 Berlin, Germany; Friedrich-Schiller-University Jena, Institute of Condensed Matter Theory and Optics, Max-Wien-Platz 1, 07743 Jena, Germany
L
Luca Ghiringhelli
Scientific Computing Center, Karlsruhe Institute of Technology, 76131 Karlsruhe, Germany
A
Alejandra Hinostroza Caldas
Faculty of Computer Science, TUD Dresden University of Technology, 01062 Dresden, Germany
V
Veronika Juraskova
Chemistry Research Laboratory, Department of Chemistry, University of Oxford, Oxford, OX1 3TA, United Kingdom
H
Hannes Kneiding
Hylleraas Centre for Quantum Molecular Sciences, Department of Chemistry, University of Oslo, P.O. Box 1033, Blindern, 0315 Oslo, Norway
Yury Lysogorskiy
Yury Lysogorskiy
ICAMS, Ruhr University Bochum
interatomic potentialsmachine learning
J
Johannes T. Margraf
University of Bayreuth, Bavarian Center for Battery Technology (BayBatt), Bayreuth, Germany
H
Hanna Türk
École Polytechnique Fédérale de Lausanne, Institute of Materials, Lausanne, 1015 Switzerland
A
Anatole von Lilienfeld
Department of Chemistry, Chemical Physics Theory Group, University of Toronto, St. George Campus, Toronto, ON M5R 0A3, Canada
M
Milica Todorović
Department of Mechanical and Materials Engineering, University of Turku, FI-20014 Turku, Finland
Alexandre Tkatchenko
Alexandre Tkatchenko
Professor of Physics, University of Luxembourg; Visiting Professor, TU Berlin; APS Fellow; FRSC
Intermolecular InteractionsAI for ScienceChemical PhysicsMaterials Physics
Mariana Rossi
Mariana Rossi
Head of Independent Lise Meitner Group, MPI for Structure and Dynamics of Matter, Hamburg
Condensed matter physicsChemical PhysicsHydrogen Dynamics2D Materials
G
Gianaurelio Cuniberti
Institute for Materials Science and Max Bergmann Center of Biomaterials, TUD Dresden University of Technology, 01062, Dresden, Germany; Cluster of Excellence CARE, TU Dresden and RWTH Aachen, Germany; Cluster of Excellence CeTI, TU Dresden, Germany