🤖 AI Summary
To address the poor scalability and inferior generation quality of autoregressive models compared to diffusion models in 3D molecular generation, this paper introduces Quetzal—the first efficient and scalable autoregressive framework for 3D molecule generation. Its core innovation is atom serialization modeling: a causal Transformer predicts discrete atom types, while a lightweight continuous-coordinate diffusion MLP jointly models 3D atomic position distributions. The method natively supports variable-length generation tasks (e.g., hydrogen addition, scaffold completion) and enables exact likelihood computation and KL-divergence evaluation without architectural modification. Experiments demonstrate that Quetzal significantly outperforms existing autoregressive baselines and matches state-of-the-art diffusion models in generation quality—while achieving substantially faster inference. Quetzal thus establishes a highly competitive new pathway for autoregressive modeling in 3D molecular generation.
📝 Abstract
Generative models of 3D molecular structure play a rapidly growing role in the design and simulation of molecules. Diffusion models currently dominate the space of 3D molecule generation, while autoregressive models have trailed behind. In this work, we present Quetzal, a simple but scalable autoregressive model that builds molecules atom-by-atom in 3D. Treating each molecule as an ordered sequence of atoms, Quetzal combines a causal transformer that predicts the next atom's discrete type with a smaller Diffusion MLP that models the continuous next-position distribution. Compared to existing autoregressive baselines, Quetzal achieves substantial improvements in generation quality and is competitive with the performance of state-of-the-art diffusion models. In addition, by reducing the number of expensive forward passes through a dense transformer, Quetzal enables significantly faster generation speed, as well as exact divergence-based likelihood computation. Finally, without any architectural changes, Quetzal natively handles variable-size tasks like hydrogen decoration and scaffold completion. We hope that our work motivates a perspective on scalability and generality for generative modelling of 3D molecules.