Logos: An evolvable reasoning engine for rational molecular design

📅 2026-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge that existing AI models struggle to simultaneously ensure chemical validity and interpretable reasoning in molecular design, thereby limiting their reliability in scientific practice. To this end, the authors propose Logos—a compact molecular reasoning model that uniquely integrates explicit multi-step logical inference with strict chemical constraints through joint optimization. By employing a staged training strategy, Logos aligns reasoning chains with molecular representations and embeds domain-specific chemical rules directly into the optimization objective. Experimental results demonstrate that Logos achieves or surpasses the performance of large general-purpose language models in terms of structural accuracy and chemical validity across multiple benchmarks. Moreover, it exhibits robustness in multi-constraint molecular optimization tasks while maintaining high parameter efficiency and strong interpretability.

Technology Category

Application Category

📝 Abstract
The discovery and design of functional molecules remain central challenges across chemistry,biology, and materials science. While recent advances in machine learning have accelerated molecular property prediction and candidate generation, existing models tend to excel either in physical fidelity without transparent reasoning, or in flexible reasoning without guarantees of chemical validity. This imbalance limits the reliability of artificial intelligence systems in real scientific design workflows.Here we present Logos, a compact molecular reasoning model that integrates multi-step logical reasoning with strict chemical consistency. Logos is trained using a staged strategy that first exposes the model to explicit reasoning examples linking molecular descriptions to structural decisions, and then progressively aligns these reasoning patterns with molecular representations. In a final training phase, chemical rules and invariants are incorporated directly into the optimization objective, guiding the model toward chemically valid outputs. Across multiple benchmark datasets, Logos achieves strong performance in both structural accuracy and chemical validity, matching or surpassing substantially larger general-purpose language models while operating with a fraction of their parameters. Beyond benchmark evaluation, the model exhibits stable behaviour in molecular optimization tasks involving multiple, potentially conflicting constraints. By explicitly exposing intermediate reasoning steps, Logos enables human inspection and assessment of the design logic underlying each generated structure. These results indicate that jointly optimizing for reasoning structure and physical consistency offers a practical pathway toward reliable and interpretable AI systems for molecular science, supporting closer integration of artificial intelligence into scientific discovery processes.
Problem

Research questions and friction points this paper is trying to address.

molecular design
chemical validity
reasoning
interpretability
AI reliability
Innovation

Methods, ideas, or system contributions that make the work stand out.

molecular reasoning
chemical validity
interpretable AI
staged training
constraint-aware generation
🔎 Similar Papers
No similar papers found.
H
Haibin Wen
City University of Hong Kong, Hong Kong 999077, China
Zhe Zhao
Zhe Zhao
Renmin university of China
natural language processing
F
Fanfu Wang
Lanzhou University, Lanzhou 730000, China
Tianyi Xu
Tianyi Xu
Tulane University
Reinforcement LearningNetwork OptimizaitonStatisticsNLP(LLM)Operations research
H
Hao Zhang
Riltide Medicines, Beijing, China
C
Chao Yang
Shanghai Jiao Tong University, Shanghai 200240, China
Ye Wei
Ye Wei
City University of Hong Kong; Max Planck Institute
Learning of complex systemsData-driven optimization