- [Aug. 2025] Named Affiliate Member of Stanford CORES
- [May. 2025] Delivered keynote at Stats4Onc conference
- [Apr. 2025] Gave invited talk at Stanford Data Science Conference
- [Mar. 2025] Received Nvidia research grant
- [Feb. 2025] Paper accepted to CVPR 2025
- [Aug. 2024] Selected as Stanford Data Science Scholar
- [Jun. 2024] Began internship at Hugging Face (ML Engineer)
- Selected Publications:
- No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models
- BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
- Can Large Language Models Match the Conclusions of Systematic Reviews?
- regionalpcs: improved discovery of DNA methylation associations with complex traits
Research Experience
Most recently, a Machine Learning Engineer intern at Hugging Face. Previously, worked on NGS lab workflows at Invitae and early cancer detection at Guardant Health.
Education
Pursuing a PhD at Stanford, co-advised by Serena Yeung-Levy and Robert Tibshirani. Supported by the National Library of Medicine T15 grant, ARPA-H, and the Stanford Data Science Scholars fellowship.
Background
A PhD candidate at Stanford, developing machine learning methods to advance biomedical research and clinical care. My work focuses on training biomedical vision-language models and building large-scale open-source datasets to create reproducible, generalizable, and clinically meaningful AI for applications in areas like precision oncology.