🤖 AI Summary
Is explicit equivariant modeling strictly necessary for high-quality 3D molecular generation? This work challenges the prevailing assumption that architectural equivariance is indispensable.
Method: We investigate whether non-equivariant convolutional neural networks (CNNs) can implicitly learn rotational equivariance through extensive rotation-based data augmentation. We introduce a novel loss decomposition framework to quantitatively analyze how model size, dataset scale, and training duration affect the emergence of equivariance—first such systematic study in molecular generation. Experiments span denoising, conformational generation, and property prediction.
Results: With sufficient rotational augmentation, non-equivariant CNNs match or surpass state-of-the-art equivariant graph neural networks (GNNs) across all tasks, while exhibiting greater training stability, lower computational overhead, and superior scalability. Our findings establish a lightweight, efficient paradigm for 3D molecular generation without architectural equivariance constraints.
📝 Abstract
Deep generative models are increasingly used for molecular discovery, with most recent approaches relying on equivariant graph neural networks (GNNs) under the assumption that explicit equivariance is essential for generating high-quality 3D molecules. However, these models are complex, difficult to train, and scale poorly.
We investigate whether non-equivariant convolutional neural networks (CNNs) trained with rotation augmentations can learn equivariance and match the performance of equivariant models. We derive a loss decomposition that separates prediction error from equivariance error, and evaluate how model size, dataset size, and training duration affect performance across denoising, molecule generation, and property prediction. To our knowledge, this is the first study to analyze learned equivariance in generative tasks.