Structural Reasoning Improves Molecular Understanding of LLM

📅 2024-10-08
📈 Citations: 2
Influential: 0
📄 PDF
🤖 AI Summary
Current large language models (LLMs) exhibit significant limitations in molecular structure reasoning, particularly in leveraging critical structural features—such as functional groups—to predict molecular properties. Method: We propose Molecular Structure Reasoning (MSR), the first framework to explicitly incorporate molecular structural sketches into LLM-based reasoning. MSR establishes a dual-path paradigm for reasoning over both known and unknown molecules, integrating SMILES and graph-based structural encodings, structure-aware prompt engineering, and a multi-stage reasoning chain to achieve interpretable, structure-to-language mapping. Contribution/Results: Evaluated across multiple molecular property prediction and functional group identification tasks, MSR consistently achieves substantial accuracy improvements over baseline LLMs. These results empirically validate that explicit structural modeling is both effective and essential for enhancing LLMs’ chemical understanding—bridging a key gap between symbolic chemical knowledge and neural language reasoning.

Technology Category

Application Category

📝 Abstract
Recently, large language models (LLMs) have shown significant progress, approaching human perception levels. In this work, we demonstrate that despite these advances, LLMs still struggle to reason using molecular structural information. This gap is critical because many molecular properties, including functional groups, depend heavily on such structural details. To address this limitation, we propose an approach that sketches molecular structures for reasoning. Specifically, we introduce Molecular Structural Reasoning (MSR) framework to enhance the understanding of LLMs by explicitly incorporating the key structural features. We present two frameworks for scenarios where the target molecule is known or unknown. We verify that our MSR improves molecular understanding through extensive experiments.
Problem

Research questions and friction points this paper is trying to address.

LLMs struggle with molecular structural reasoning
Molecular properties depend on structural details
Propose MSR framework to enhance LLM understanding
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposes Molecular Structural Reasoning (MSR) framework
Sketches molecular structures for enhanced reasoning
Explicitly incorporates key structural features
🔎 Similar Papers
No similar papers found.
Y
Yunhui Jang
Pohang University of Science and Technology (POSTECH)
J
Jaehyung Kim
Yonsei University
Sungsoo Ahn
Sungsoo Ahn
KAIST
Machine Learning