Chuanhao Li (李川皓)
Scholar

Chuanhao Li (李川皓)

Google Scholar ID: s6xK5NUAAAAJ
Shanghai AI Laboratory
vision-and-languageMLLMscompositional generalization
Citations & Impact
All-time
Citations
146
 
H-index
6
 
i10-index
3
 
Publications
20
 
Co-authors
6
list available
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • arXiv 2025: From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration
  • arXiv 2025: Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry
  • arXiv 2025: Multi-Step Reasoning for Embodied Question Answering via Tool Augmentation
  • arXiv 2025: MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
  • arXiv 2025: YUME: An Interactive World Generation Model
  • arXiv 2025: IA-T2I: Internet-Augmented Text-to-Image Generation
  • arXiv 2025: A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
  • arXiv 2025: ARMOR: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy
  • arXiv 2025: SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model
  • AAAI 2026: MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models
  • NeurIPS 2025: Sekai: A Video Dataset towards World Exploration
  • EMNLP 2025: InMind: Evaluating LLMs in Capturing
Research Experience
  • Researcher, Shanghai AI Lab
Background
  • Research interests: vision-and-language, image/video generation, internet-augmented generation, compositional generalization. Currently a researcher at Shanghai AI Lab, collaborating closely with Dr. Kaipeng Zhang and Dr. Wenqi Shao.