Liangyu Chen
Scholar

Liangyu Chen

Google Scholar ID: vi5Zt9oAAAAJ
Stanford University
Machine LearningComputer VisionHealth InformaticsNatural Language Processing
Citations & Impact
All-time
Citations
1,264
 
H-index
11
 
i10-index
12
 
Publications
17
 
Co-authors
7
list available
Publications
17 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Terminal-Bench: A Benchmark for AI Agents in Terminal Environments
  • The Impact of Image Resolution on Biomedical Multimodal Large Language Models
  • Open Thoughts: The first open-source model trained on public reasoning data to match DeepSeek-R1-Distill's performance through 1000+ systematic data curation/synthesis experiments
  • BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
  • Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement
  • MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations
Research Experience
  • Currently working as a research scientist intern at Meta Superintelligence Labs.
Education
  • Ph.D. student at Stanford University, advised by Serena Yeung and Ludwig Schmidt; Undergraduate at Nanyang Technological University, advised by Ziwei Liu; Worked with Alan Yuille and Zongwei Zhou at Johns Hopkins University.
Background
  • A Computer Science Ph.D. student, with research interests in developing scalable frameworks for training and evaluating AI agents, multimodal reasoning agents, data-efficient training methodologies, data collection pipelines, and practical architectures that enable agents to perform complex reasoning tasks. Passionate about bridging the gap between research and real-world applications of AI systems.
Miscellany
  • Passionate about connecting with people working on synthetic data, exploring collaborative research opportunities, or developing practical reasoning applications to discuss potential synergies.