- EMNLP 2023: BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations
- ACL 2024 (Findings): BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning
- ICLR 2025: 3D-MolT5: Leveraging Discrete Structural Information for Molecule-Text Modeling
- NeurIPS 2023: FABind: Fast and Accurate Protein-Ligand Binding
- CIKM 2024: Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors
- Briefings in Bioinformatics 2023: SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction
- KDD 2025: FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation
- Nature Communications 2024: TamGen: drug design with target-aware molecule generation through a chemical language model
- Awards:
- 2024 ACL Workshop/Competition: 1st Place (Text-based Molecule Generation Track), 2nd Place (Molecular Captioning Track)
Research Experience
- Internship: OpenDataLab, Shanghai Artificial Intelligent Laboratory, mentored by Dr. Lijun Wu
- Core contributor: OpenDataArena
Education
- Ph.D. degree: Gaoling School of Artificial Intelligence (GSAI), Renmin University of China, supervised by Prof. Rui Yan (2022-present)
- B.S. degree: School of Computer Science and Technology, University of Science and Technology of China (USTC) (2022)
Background
Currently a fourth-year Ph.D. student at the ALOHA group of Gaoling School of Artificial Intelligence (GSAI) in Renmin University of China, supervised by Prof. Rui Yan. His research interests include AI4science and data-centric LLMs.