Scholar
Haoning Wu
Google Scholar ID: ia4M9mMAAAAJ
Shanghai Jiao Tong University
Computer Vision
Multi-modal Learning
Generative Models
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
194
H-index
6
i10-index
5
Publications
11
Co-authors
0
Contact
No contact links provided.
Publications
26 items
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning
2026
Cited
0
OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams
2026
Cited
0
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
2026
Cited
0
Towards Pixel-Level VLM Perception via Simple Points Prediction
2026
Cited
1
BabyVision: Visual Reasoning Beyond Language
arXiv.org · 2026
Cited
4
SoccerMaster: A Vision Foundation Model for Soccer Understanding
2025
Cited
0
VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results
2025
Cited
0
A Survey on the Techniques and Tools for Automated Requirements Elicitation and Analysis of Mobile Apps
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up