MolRecBench-Wild: A Real-World Benchmark for Optical Chemical Structure Recognition

📅 2026-05-07
📈 Citations: 0
Influential: 0
📄 PDF

career value

213K/year
🤖 AI Summary
Existing optical chemical structure recognition (OCSR) systems exhibit poor performance on real-world scientific literature due to complex visual interferences and challenging chemical semantics. To address this gap, this work proposes MolRecBench-Wild—the first benchmark for chemical structure recognition in the wild, grounded in the two-dimensional difficulty framework MOSAIC. It comprises 5,029 structures annotated with 37 fine-grained attributes. The study introduces CARBON, a novel representation language capable of expressing unconventional chemical semantics, and devises a dual-track evaluation protocol compatible with both CARBON and SMILES. Systematic evaluation of 18 state-of-the-art OCSR models reveals a significant performance drop in authentic academic settings, underscoring a substantial discrepancy between current methodologies and practical deployment requirements.
📝 Abstract
Optical Chemical Structure Recognition (OCSR) aims to translate molecular diagrams in scientific literature into machine-readable formats, but current systems remain unreliable on real-world images due to substantial visual and chemical complexity. We introduce MOSAIC, a dual-dimensional difficulty framework with 37 fine-grained labels that jointly characterize visual interference and chemical semantic challenges in molecular diagrams. Based on this framework, we construct MolRecBench-Wild, a benchmark of 5,029 structures from 820 recent chemistry papers, covering the full difficulty spectrum observed in real publications. To enable faithful semantic evaluation beyond SMILES and MolFile, we propose CARBON, a representation language capable of expressing valence variations, icon-based groups, and other non-standard chemical semantics. We further adopt a dual-track evaluation protocol supporting both CARBON and SMILES outputs for broad model compatibility. Comprehensive experiments over 18 OCSR-capable models reveal severe performance degradation on MolRecBench-Wild, exposing a large gap between previous patent benchmarks and real-world academic scenarios.
Problem

Research questions and friction points this paper is trying to address.

Optical Chemical Structure Recognition
real-world benchmark
visual interference
chemical semantic complexity
molecular diagrams
Innovation

Methods, ideas, or system contributions that make the work stand out.

MOSAIC
MolRecBench-Wild
CARBON
Optical Chemical Structure Recognition
dual-dimensional difficulty
Haote Yang
Haote Yang
PJLab
CVLLMMLLMAI4S
H
Hui Wang
King's College London
C
Chen Zhu
East China University of Science and Technology
Jingchao Wang
Jingchao Wang
East China Normal University
AI
L
Linye Li
Tongji University
H
Hongbin Lai
Peking University
H
Huijie Ao
Fudan University
Y
Yongxuan Lyu
University of Science and Technology of China
Jiang Wu
Jiang Wu
Shanghai Artificial Intelligence Laboratory
large language modelvision language model
J
Jiaxing Sun
Shanghai Artificial Intelligence Laboratory
L
Lua Chen
Shanghai Artificial Intelligence Laboratory
Y
Yuanyuan Cao
Shanghai Artificial Intelligence Laboratory
R
Ruijie Zhang
Shanghai Artificial Intelligence Laboratory
S
Shengxin Lu
Shanghai Artificial Intelligence Laboratory
Lijun Wu
Lijun Wu
Shanghai AI Laboratory
MLLLMAI4Science
Bin Wang
Bin Wang
Pengcheng Laboratory
Cloud ComputingIIoTGreen ComputingComputer Architecture
Conghui He
Conghui He
Shanghai AI Laboratory
Data-centric AILLMDocument Intelligence