Suiren-1.0 Technical Report: A Family of Molecular Foundation Models

📅 2026-03-23
📈 Citations: 0
Influential: 0
📄 PDF

career value

205K/year
🤖 AI Summary
This work addresses the modeling gap between three-dimensional molecular conformations and two-dimensional statistical representations by introducing the Suiren-1.0 family of molecular foundation models. Built upon an SE(3)-equivariant architecture, Suiren-1.0 integrates spatial self-supervised learning with large-scale pretraining on density functional theory data and features a novel Conformational Compression Distillation (CCD) framework—leveraging diffusion models for the first time to compress 3D structures into efficient 2D representations. The model encompasses monomers, dimers, and conformational ensembles, establishing a multimodal molecular representation system that achieves state-of-the-art performance across multiple molecular property prediction tasks. All models and evaluation benchmarks are publicly released.

Technology Category

Application Category

📝 Abstract
We introduce Suiren-1.0, a family of molecular foundation models for the accurate modeling of diverse organic systems. Suiren-1.0 comprising three specialized variants (Suiren-Base, Suiren-Dimer, and Suiren-ConfAvg) is integrated within an algorithmic framework that bridges the gap between 3D conformational geometry and 2D statistical ensemble spaces. We first pre-train Suiren-Base (1.8B parameters) on a 70M-sample Density Functional Theory dataset using spatial self-supervision and SE(3)-equivariant architectures, achieving robust performance in quantum property prediction. Suiren-Dimer extends this capability through continued pre-training on 13.5M intermolecular interaction samples. To enable efficient downstream application, we propose Conformation Compression Distillation (CCD), a diffusion-based framework that distills complex 3D structural representations into 2D conformation-averaged representations. This yields the lightweight Suiren-ConfAvg, which generates high-fidelity representations from SMILES or molecular graphs. Our extensive evaluations demonstrate that Suiren-1.0 establishes state-of-the-art results across a range of tasks. All models and benchmarks are open-sourced.
Problem

Research questions and friction points this paper is trying to address.

molecular modeling
3D conformational geometry
2D statistical ensemble
quantum property prediction
intermolecular interactions
Innovation

Methods, ideas, or system contributions that make the work stand out.

molecular foundation models
SE(3)-equivariant architecture
conformation compression distillation
3D-to-2D representation learning
diffusion-based distillation
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
J
Junyi An
Shanghai Academy of AI for Science (SAIS), Golab
X
Xinyu Lu
Shanghai Academy of AI for Science (SAIS), Golab
Y
Yun-Fei Shi
Shanghai Academy of AI for Science (SAIS), Golab
L
Li-Cheng Xu
Shanghai Academy of AI for Science (SAIS), Golab
Nannan Zhang
Nannan Zhang
research scientist, NIH
C
Chao Qu
Shanghai Academy of AI for Science (SAIS), Golab
Y
Yuan Qi
Shanghai Academy of AI for Science (SAIS), Golab
F
Fenglei Cao
Shanghai Academy of AI for Science (SAIS), Golab