Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Terminal-Bench: A Benchmark for AI Agents in Terminal Environments
The Impact of Image Resolution on Biomedical Multimodal Large Language Models
Open Thoughts: The first open-source model trained on public reasoning data to match DeepSeek-R1-Distill's performance through 1000+ systematic data curation/synthesis experiments
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations
Research Experience
Currently working as a research scientist intern at Meta Superintelligence Labs.
Education
Ph.D. student at Stanford University, advised by Serena Yeung and Ludwig Schmidt; Undergraduate at Nanyang Technological University, advised by Ziwei Liu; Worked with Alan Yuille and Zongwei Zhou at Johns Hopkins University.
Background
A Computer Science Ph.D. student, with research interests in developing scalable frameworks for training and evaluating AI agents, multimodal reasoning agents, data-efficient training methodologies, data collection pipelines, and practical architectures that enable agents to perform complex reasoning tasks. Passionate about bridging the gap between research and real-world applications of AI systems.
Miscellany
Passionate about connecting with people working on synthetic data, exploring collaborative research opportunities, or developing practical reasoning applications to discuss potential synergies.