🤖 AI Summary
Graph Neural Networks (GNNs) struggle to effectively model long-range interactions (>10 hops), limiting their applicability in molecular science and related domains. To address this, we propose ECHO—a dedicated, structured benchmark for evaluating long-range information propagation. ECHO comprises three synthetic graph tasks with controllable topologies (shortest path, eccentricity, diameter) and two density functional theory (DFT)-level real-world molecular property prediction tasks (atomic charge distribution and total energy). It introduces structured information-bottleneck graphs and chemically interpretable tasks to rigorously assess mainstream GNNs—including GCN, GAT, GIN, and PNA. Experimental results reveal a 40–70% performance degradation on long-range tasks across all models; message aggregation mechanisms and skip-connection designs emerge as primary bottlenecks. ECHO provides a reproducible diagnostic tool and a new evaluation standard for assessing long-range reasoning capabilities in AI for Science.
📝 Abstract
Effectively capturing long-range interactions remains a fundamental yet unresolved challenge in graph neural network (GNN) research, critical for applications across diverse fields of science. To systematically address this, we introduce ECHO (Evaluating Communication over long HOps), a novel benchmark specifically designed to rigorously assess the capabilities of GNNs in handling very long-range graph propagation. ECHO includes three synthetic graph tasks, namely single-source shortest paths, node eccentricity, and graph diameter, each constructed over diverse and structurally challenging topologies intentionally designed to introduce significant information bottlenecks. ECHO also includes two real-world datasets, ECHO-Charge and ECHO-Energy, which define chemically grounded benchmarks for predicting atomic partial charges and molecular total energies, respectively, with reference computations obtained at the density functional theory (DFT) level. Both tasks inherently depend on capturing complex long-range molecular interactions. Our extensive benchmarking of popular GNN architectures reveals clear performance gaps, emphasizing the difficulty of true long-range propagation and highlighting design choices capable of overcoming inherent limitations. ECHO thereby sets a new standard for evaluating long-range information propagation, also providing a compelling example for its need in AI for science.