Scholar

Min Dou

Google Scholar ID: w9fTWKQAAAAJ

Shanghai AI Laboratory

Autonomous DrivingMLLMEmbodied AI

Google Scholar↗

Citations & Impact

All-time

Citations

3,192

H-index

15

i10-index

16

Publications

20

Co-authors

0

Contact

No contact links provided.

Publications

7 items

Training-Free Acceleration for Document Parsing Vision-Language Model with Hierarchical Speculative Decoding

2026

Cited

0

InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models

2025

Cited

0

InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling

2025

Cited

0

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

2024

Cited

6

DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes

arXiv.org · 2024

Cited

1

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

arXiv.org · 2024

Cited

63

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

arXiv.org · 2024

Cited

30

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)