De-An Huang
Scholar

De-An Huang

Google Scholar ID: HEY3UzgAAAAJ
Stanford University
Computer VisionRoboticsMachine LearningBioinformatics
Citations & Impact
All-time
Citations
6,563
 
H-index
37
 
i10-index
45
 
Publications
20
 
Co-authors
152
list available
Contact
Resume (English only)
Academic Achievements
  • Authored multiple high-impact papers at top venues including CVPR 2025, ICLR 2025, and arXiv, such as:
  • - FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding
  • - Eagle series (Eagle 2, Eagle 2.5): Post-training data strategies for frontier vision-language models
  • - QLIP: Text-Aligned Visual Tokenization
  • - Omni-RGPT: Unified region-level image and video understanding
  • - NVILA: Efficient frontier visual language models
  • - T-Stitch: Accelerating sampling in pre-trained diffusion models
  • - X-VILA: Cross-modality alignment for large language models
  • - ARDuP: Active region video diffusion for universal policies
Research Experience
  • Research Scientist at NVIDIA
  • Summer internships at leading research labs:
  • - NVIDIA Seattle Robotics Lab (with Dieter Fox)
  • - Facebook Applied Machine Learning (with Vignesh Ramanathan and Dhruv Mahajan)
  • - Microsoft Research Redmond (with Zicheng Liu)
  • - Disney Research Pittsburgh (with Leonid Sigal)