Scholar
Huanyu Zhang
Google Scholar ID: mtI1oVQAAAAJ
Institute of Automation, Chinese Academy of Sciences
Multimodal Reasoning
MLLM
Time Series Analysis
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
211
H-index
6
i10-index
4
Publications
9
Co-authors
14
list available
Contact
No contact links provided.
Publications
16 items
PEARL: Personalized Streaming Video Understanding Model
2026
Cited
0
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
2026
Cited
0
GEBench: Benchmarking Image Generation Models as GUI Environments
2026
Cited
0
How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing
2026
Cited
2
Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning
2026
Cited
2
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs
2025
Cited
0
Items Proxy Bridging: Enabling Frictionless Critiquing in Knowledge Graph Recommendations
2025
Cited
0
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
2025
Cited
0
Load more
Resume (English only)
Co-authors
14 total
Yi-Fan Zhang
Institute of Automation, Chinese Academy of Sciences
Tieniu Tan
Institute of Automation, Chinese Academy of Sciences
Haochen Tian
Institute of Automation, Chinese Academy of Sciences
Chaoyou Fu
Nanjing University
Zhang Zhang
Institute of Automation, Chinese Academy of Sciences
Chengzu Li
University of Cambridge
Wenshan Wu
Senior Research SDE, Microsoft Research Asia
Furu Wei
Distinguished Scientist, Microsoft Research
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up