Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI

πŸ“… 2023-06-03
πŸ›οΈ Angewandte Chemie
πŸ“ˆ Citations: 2
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Small-molecule colloidal aggregate-mediated false positives remain a persistent challenge in high-throughput screening (HTS), impeding reliable hit identification. Method: This study introduces MEGAN, an interpretable AI framework that models molecular aggregation propensity using graph neural networks (GNNs), employs SHAP for explainable feature attribution, and pioneers a molecule-level counterfactual generation algorithm enabling atomic- or functional-group–scale structural modifications to mitigate aggregation. Contribution/Results: MEGAN transcends conventional medicinal chemistry intuition by accurately identifying non-intuitive aggregation behavior and delivering actionable, structure-based optimization strategies. Experimental validation via UV-Vis spectroscopy and dynamic light scattering (DLS) confirms both predictive accuracy and efficacy of designed modifications. The approach significantly improves false-positive detection rates and accelerates the identification of high-quality lead compounds.
πŸ“ Abstract
Herein, we present the application of MEGAN, our explainable AI (xAI) model, for the identification of small colloidally aggregating molecules (SCAMs). This work offers solutions to the long-standing problem of false positives caused by SCAMs in high throughput screening for drug discovery and demonstrates the power of xAI in the classification of molecular properties that are not chemically intuitive based on our current understanding. We leverage xAI insights and molecular counterfactuals to design alternatives to problematic compounds in drug screening libraries. Additionally, we experimentally validate the MEGAN prediction classification for one of the counterfactuals and demonstrate the utility of counterfactuals for altering the aggregation properties of a compound through minor structural modifications. The integration of this method in high-throughput screening approaches will help combat and circumvent false positives, providing better lead molecules more rapidly and thus accelerating drug discovery cycles.
Problem

Research questions and friction points this paper is trying to address.

Identifying small colloidally aggregating molecules in drug discovery
Reducing false positives in high throughput screening
Designing non-aggregating compounds using explainable AI insights
Innovation

Methods, ideas, or system contributions that make the work stand out.

Explainable AI model MEGAN identifies aggregating molecules
Leverages xAI insights for designing alternative compounds
Validates predictions with experimental counterfactual modifications
πŸ”Ž Similar Papers
No similar papers found.
H
Hunter Sturm
University of Manitoba, Winnipeg, Canada
Jonas Teufel
Jonas Teufel
Karlsruher Institut of Technology
Machine Learning and Optimization
K
Kaitlin A Isfeld
University of Manitoba, Winnipeg, Canada
Pascal Friederich
Pascal Friederich
Karlsruhe Institute of Technology
Machine LearningMaterials designGraph Neural NetworksComputational chemistryMultiscale modeling
R
Rebecca Davis
University of Manitoba, Winnipeg, Canada