Scholar

Yusheng Su

Google Scholar ID: xwy6Va4AAAAJ

AMD | Tsinghua University

Large Language ModelMachine LearningMLSys

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

4,891

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailyushengsu.thu@gmail.com CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

8 items

Instella: Fully Open Language Models with Stellar Performance

2025

Cited

APRIL: Active Partial Rollouts in Reinforcement Learning to tame long-tail generation

2025

Cited

Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation

2025

Cited

Unleashing Hour-Scale Video Training for Long Video-Language Understanding

2025

Cited

MOVi: Training-free Text-conditioned Multi-Object Video Generation

2025

Cited

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

2025

Cited

Self-Taught Agentic Long Context Understanding

2025

Cited

Agent Laboratory: Using LLM Agents as Research Assistants

2025

Cited

Resume (English only)

Academic Achievements

Papers: 'Exploring the Impact of Model Scaling on Parameter-efficient Tuning Methods' accepted by EMNLP 2023; 'ChatDev' accepted by ACL 2024; 'AgentVerse' and 'ChatEval' accepted by ICLR 2024. Project contributions: Created APRIL and released it on GitHub; created rlsys dockerhub to support RL training framework on AMD MI-series GPUs; integrated AMD ROCm support into slime and verl.

Research Experience

Postdoctoral Researcher at CMU/MBZUAI, hosted by Eric Xing. Now a research scientist at AMD GenAI team.

Education

Ph.D. in Department of Computer Science and Technology at Tsinghua University from 2019 to 2023, advised by Zhiyuan Liu and part of THUNLP Lab hosted by Maosong Sun.

Background

LLM researcher/engineer, with research interests in foundation models, particularly focusing on pre-training/post-training frameworks and training efficiency optimization. Currently working as a research scientist at AMD's GenAI team.

Miscellany