Defa Zhu
Scholar

Defa Zhu

Google Scholar ID: v4ySl6MAAAAJ
ByteDance
AGI
Citations & Impact
All-time
Citations
192
 
H-index
5
 
i10-index
4
 
Publications
9
 
Co-authors
0
 
Resume (English only)
Academic Achievements
  • - 'Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts', ICML 2025
  • - 'Frac-Connections: Fractional Extension of Hyper-Connections', Tech Report
  • - 'Over-tokenized transformer: Vocabulary is generally worth scaling', ICML 2025
  • - 'Ultra-Sparse Memory Network', ICLR, 2025
Research Experience
  • - Researcher at ByteDance, focusing on Large Language Models (LLMs)
Education
  • - Master's Degree, Chinese Academy of Sciences, 2020, focused on Generative AI research
  • - Bachelor's Degree, Northeastern University, 2017
Background
  • Research Interests: Developing stronger architecture of Large Language Models (LLMs). Professional Field: AI Research.
Co-authors
0 total
Co-authors: 0 (list not available)