Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI

📅 2023-06-03

🏛️ Angewandte Chemie

📈 Citations: 2

✨ Influential: 0

career value

204K/year

🤖 AI Summary

Small-molecule colloidal aggregate-mediated false positives remain a persistent challenge in high-throughput screening (HTS), impeding reliable hit identification. Method: This study introduces MEGAN, an interpretable AI framework that models molecular aggregation propensity using graph neural networks (GNNs), employs SHAP for explainable feature attribution, and pioneers a molecule-level counterfactual generation algorithm enabling atomic- or functional-group–scale structural modifications to mitigate aggregation. Contribution/Results: MEGAN transcends conventional medicinal chemistry intuition by accurately identifying non-intuitive aggregation behavior and delivering actionable, structure-based optimization strategies. Experimental validation via UV-Vis spectroscopy and dynamic light scattering (DLS) confirms both predictive accuracy and efficacy of designed modifications. The approach significantly improves false-positive detection rates and accelerates the identification of high-quality lead compounds.

📝 Abstract

Herein, we present the application of MEGAN, our explainable AI (xAI) model, for the identification of small colloidally aggregating molecules (SCAMs). This work offers solutions to the long-standing problem of false positives caused by SCAMs in high throughput screening for drug discovery and demonstrates the power of xAI in the classification of molecular properties that are not chemically intuitive based on our current understanding. We leverage xAI insights and molecular counterfactuals to design alternatives to problematic compounds in drug screening libraries. Additionally, we experimentally validate the MEGAN prediction classification for one of the counterfactuals and demonstrate the utility of counterfactuals for altering the aggregation properties of a compound through minor structural modifications. The integration of this method in high-throughput screening approaches will help combat and circumvent false positives, providing better lead molecules more rapidly and thus accelerating drug discovery cycles.

Problem

Research questions and friction points this paper is trying to address.

Identifying small colloidally aggregating molecules in drug discovery

Reducing false positives in high throughput screening

Designing non-aggregating compounds using explainable AI insights

Innovation

Methods, ideas, or system contributions that make the work stand out.

Explainable AI model MEGAN identifies aggregating molecules

Leverages xAI insights for designing alternative compounds

Validates predictions with experimental counterfactual modifications

🔎 Similar Papers

No similar papers found.