Scholar
Min Dou
Google Scholar ID: w9fTWKQAAAAJ
Shanghai AI Laboratory
Autonomous Driving
MLLM
Embodied AI
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
3,192
H-index
15
i10-index
16
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
7 items
Training-Free Acceleration for Document Parsing Vision-Language Model with Hierarchical Speculative Decoding
2026
Cited
0
InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models
2025
Cited
0
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
2025
Cited
0
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
2024
Cited
6
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
arXiv.org · 2024
Cited
1
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
arXiv.org · 2024
Cited
63
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning
arXiv.org · 2024
Cited
30
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up