Scholar

Bohan Li

Google Scholar ID: V-YdQiAAAAAJ

Shanghai Jiao Tong University

3D Visionstereo matchingdisparity regression

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

196

H-index

i10-index

Publications

Co-authors

list available

Contact

GitHubOpen ↗

Publications

8 items

TokAN: Accent Normalization Using Self-Supervised Speech Tokens

2026

Cited

Bridging 3D Gaussians and Semantic Occupancy for Comprehensive Open-Vocabulary Scene Understanding from Unposed Images

2026

Cited

Read What You Hear: Reference-Free Hypotheses Evaluation with Acoustic Discrepancy

2026

Cited

HoliTok:A Coutinuous Holistic Tokenization with Robust Dual Capabilities of Speech Generation and Understanding

2026

Cited

From Articulated Kinematics to Routed Visual Control for Action-Conditioned Surgical Video Generation

2026

Cited

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

2026

Cited

PAM: A Pose-Appearance-Motion Engine for Sim-to-Real HOI Video Generation

2026

Cited

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

2026

Cited

Resume (English only)

Academic Achievements

Published papers:
- OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation (IEEE TPAMI)
- OmniNWM: Omniscient Driving Navigation World Models (Arxiv)
- UniScene: Unified Occupancy-centric Driving Scene Generation (CVPR 2025)
- NaviNeRF++: Towards Interpretable 3D Reconstruction via Unsupervised Disentangled Representation Learning (IEEE TPAMI)
- Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method (Arxiv)
- Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond (Arxiv)
- ORV: 4D Occupancy-centric Robot Video Generation (Arxiv)

Research Experience

Worked as a Computer Vision Algorithm Engineer at Tencent AI Lab, focusing on large-scale 3D city scene generation and reconstruction. Involved in multiple research projects including OmniNWM, UniScene, etc.

Education

Ph.D. student at Shanghai Jiao Tong University (SJTU) and Eastern Institute of Technology (EIT), Ningbo, advised by Prof. Xin Jin, Prof. Wenjun Zeng, Prof. Chao Ma, and Prof. Xiaokang Yang; Master's degree from South China University of Technology (SCUT); Bachelor's degree from Northeastern University (NEU). Also, a Visiting Scholar at the National University of Singapore (NUS) in the Department of Biomedical Engineering and Department of Electrical and Computer Engineering, working with Prof. Yueming Jin, and closely collaborated with Prof. Hao Zhao at Tsinghua University.

Background

Research interest in 3D computer vision, especially focusing on 3D scene comprehension and multi-modal generation. Previously worked as a Computer Vision Algorithm Engineer at Tencent AI Lab, working on large-scale 3D city scene generation and reconstruction.

Miscellany