Ziyuan Huang
Scholar

Ziyuan Huang

Google Scholar ID: A9D-disAAAAJ
Ant Group
Multimodal LLMVisual GenerationUnified Model
Citations & Impact
All-time
Citations
2,578
 
H-index
20
 
i10-index
29
 
Publications
20
 
Co-authors
15
list available
Resume (English only)
Academic Achievements
  • 2024, 'Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight', Tech report.
  • 2024, 'Towards Better Vision-Inspired Vision-Language Models', CVPR.
  • 2024, 'Res-tuning: A flexible and efficient tuning paradigm via unbinding tuner from backbone', NeurIPS.
  • 2023, 'Towards Real-World Visual Tracking with Temporal Contexts', TPAMI.
  • 2023, 'Temporally-Adaptive Models for Efficient Video Understanding', Tech report.