Min Woo Sun
Scholar

Min Woo Sun

Google Scholar ID: SmQuNpEAAAAJ
Stanford University
Machine LearningMultimodal LearningStatistics
Citations & Impact
All-time
Citations
335
 
H-index
11
 
i10-index
11
 
Publications
20
 
Co-authors
5
list available
Resume (English only)
Academic Achievements
  • - [Aug. 2025] Gave invited talk at Cohere Labs
  • - [Aug. 2025] Named Affiliate Member of Stanford CORES
  • - [May. 2025] Delivered keynote at Stats4Onc conference
  • - [Apr. 2025] Gave invited talk at Stanford Data Science Conference
  • - [Mar. 2025] Received Nvidia research grant
  • - [Feb. 2025] Paper accepted to CVPR 2025
  • - [Aug. 2024] Selected as Stanford Data Science Scholar
  • - [Jun. 2024] Began internship at Hugging Face (ML Engineer)
  • - Selected Publications:
  • - No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models
  • - BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
  • - Can Large Language Models Match the Conclusions of Systematic Reviews?
  • - regionalpcs: improved discovery of DNA methylation associations with complex traits
Research Experience
  • Most recently, a Machine Learning Engineer intern at Hugging Face. Previously, worked on NGS lab workflows at Invitae and early cancer detection at Guardant Health.
Education
  • Pursuing a PhD at Stanford, co-advised by Serena Yeung-Levy and Robert Tibshirani. Supported by the National Library of Medicine T15 grant, ARPA-H, and the Stanford Data Science Scholars fellowship.
Background
  • A PhD candidate at Stanford, developing machine learning methods to advance biomedical research and clinical care. My work focuses on training biomedical vision-language models and building large-scale open-source datasets to create reproducible, generalizable, and clinically meaningful AI for applications in areas like precision oncology.
Miscellany
  • Personal interests not mentioned