Scholar
Haoning Wu
Google Scholar ID: ia4M9mMAAAAJ
Shanghai Jiao Tong University
Computer Vision
Multi-modal Learning
Generative Models
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
194
H-index
6
i10-index
5
Publications
11
Co-authors
0
Contact
No contact links provided.
Publications
29 items
Count Anything at Any Granularity
2026
Cited
0
Improving Human Image Animation via Semantic Representation Alignment
2026
Cited
0
GenTac: Generative Modeling and Forecasting of Soccer Tactics
2026
Cited
0
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning
2026
Cited
0
OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams
2026
Cited
0
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
2026
Cited
0
Towards Pixel-Level VLM Perception via Simple Points Prediction
2026
Cited
1
BabyVision: Visual Reasoning Beyond Language
arXiv.org · 2026
Cited
4
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up