Scholar

Xiangxin Zhou

Google Scholar ID: eQgIWcQAAAAJ

Unknown affiliation

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,277

H-index

i10-index

Publications

Co-authors

list available

Contact

Emaillast_namefirst_name1998@ gmail.com GitHubOpen ↗

Publications

17 items

Reinforcing Few-step Generators via Reward-Tilted Distribution Matching

2026

Cited

h-MINT: Modeling Pocket-Ligand Binding with Hierarchical Molecular Interaction Network

2026

Cited

Rethinking the Trust Region in LLM Reinforcement Learning

2026

Cited

Defeating the Training-Inference Mismatch via FP16

2025

Cited

Fine-tuning Flow Matching Generative Models with Intermediate Feedback

2025

Cited

Riemannian Consistency Model

2025

Cited

GEM: A Gym for Agentic LLMs

2025

Cited

Variational Reasoning for Language Models

2025

Cited

Resume (English only)

Academic Achievements

Published paper 'Defeating the Training-Inference Mismatch via FP16' (Preprint. 2025); Involved in project GEM: A Gym for Agentic LLMs (Preprint. 2025).

Research Experience

Xiaohongshu Hi Lab, RedStar Intern, Aug. 2025 - Present; Sea AI Lab, Associate Member, July. 2025 - Aug. 2025; ByteDance Seed, Research Intern, May. 2025 - Jul. 2025; ByteDance AI Lab, Research Intern, May. 2023 - May. 2025; ByteDance AML, Research Intern, Sep. 2022 - May. 2023.

Education

Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, School of Artificial Intelligence, Ph.D. Student, Sep. 2021 - present; Tsinghua University, B.Eng. in Electronic Engineering, Sep. 2016 - Jul. 2021. Advisor: Liang Wang.

Background

Research focuses on reinforcement learning to enhance large language models (LLMs), improving their reasoning abilities and making their responses more accurate, reliable, trustworthy, and interpretable. Also studies long-term memory for LLMs, aiming to enable LLMs to personalize their behavior through continual interactions with users. Additionally, works on AI for Drug Discovery (AIDD), developing advanced generative models and algorithms for designing small molecules and proteins.

Miscellany

Email: [last_name][first_name]1998[AT]gmail.com; Suggested GitHub Pages repository name: zhouxiangxin1998.github.io.

Co-authors

3 total

Quanquan Gu

Associate Professor of Computer Science, UCLA

Jianzhu Ma

Tsinghua University

Liang Wang

National Lab of Pattern Recognition