Scholar
Yuhang Zang
Google Scholar ID: hW23VKIAAAAJ
Shanghai AI Laboratory
Natural Language Processing
Vision Language Model
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
6,046
H-index
26
i10-index
42
Publications
20
Co-authors
76
list available
Contact
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
50 items
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space
2026
Cited
0
Visual-ERM: Reward Modeling for Visual Equivalence
2026
Cited
0
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
2026
Cited
0
EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models
2026
Cited
0
Visual Self-Refine: A Pixel-Guided Paradigm for Accurate Chart Parsing
2026
Cited
0
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition
2026
Cited
0
Unified Personalized Reward Model for Vision Generation
2026
Cited
1
EMemBench: Interactive Benchmarking of Episodic Memory for VLM Agents
2026
Cited
0
Load more
Resume (English only)
Academic Achievements
Multiple papers accepted by top international conferences and journals such as NeurIPS 2025, ICCV 2025, Findings of ACL 2025, ICML 2025, CVPR 2025, ICLR 2025, NeurIPS 2024, ACM MM 2024, ECCV 2024, CVPR 2024, IJCV. Notable works include UnifiedReward-Think, Hi-Flow, Visual-RFT, MM-IFEngine, X-Prompt, Bootstrap3D, Grounded CoT Highlight, Light-A-Video, MIR, SAM2Long, IXC-2.5-Reward, Light-ColPali, VideoRoPE, SongGen, ByTheWay, OVO-Bench, Dispider, PyramidDrop, WildAvatar, MIA-DPO, MotionClone, MMLongbench-Doc, ShareGPT4Video, MMDU, InternLM-XC2-4khd, VideoStreaming, MMStar, VLMEvalKit, Long-CLIP, MVSGaussian, Alpha-CLIP, CascadeMatch, OV-DETR.
Research Experience
Joined Apple (AI/ML) as a research intern in June 2023.
Education
Obtained Bachelor's degree from UESTC in 2019; obtained PhD from Nanyang Technological University in 2023, supervised by Prof. Chen Change Loy.
Background
Current research focuses on 1) post-training for multimodal LLMs (reinforcement fine-tuning, reward models), and 2) vision-language pre-training.
Miscellany
Hobbies and interests not mentioned
Co-authors
76 total
Xiaoyi Dong
Microsoft GenAI
Jiaqi Wang
Shanghai AI Laboratory
Dahua Lin
The Chinese University of Hong Kong
Haodong Duan
Shanghai AI Lab | CUHK | PKU
Yuhang Cao
MMLab The Chinese University of Hong Kong
Chen Change Loy
President's Chair Professor, MMLab@NTU, S-Lab, Nanyang Technological University
Kai Chen
Shanghai AI Laboratory
Kaiyang Zhou
Assistant Professor, Hong Kong Baptist University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up