Scholar
Xiaobin Hu
Google Scholar ID: 3lMuodUAAAAJ
Tencent Youtu Lab;Technische Universität München (TUM)
Deep learning
Computer vision
VLM
Agents
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
5,641
H-index
20
i10-index
28
Publications
20
Co-authors
25
list available
Contact
Email
xbhunanu@126.com
GitHub
Open ↗
Publications
68 items
Future Forcing: Future-aware Training-free KV Cache Policy for Autoregressive Video Generation
2026
Cited
0
What Semantics Survive the Connector? Diagnosing VLM-to-DiT Alignment in Video Editing
2026
Cited
0
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation
2026
Cited
0
PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset
2026
Cited
0
SPIKE: An Adaptive Dual Controller Framework for Cost-Efficient Long-Horizon Game Agents
2026
Cited
0
VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection
2026
Cited
0
Anisotropic Modality Align
2026
Cited
0
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
2026
Cited
0
Load more
Resume (English only)
Academic Achievements
Recipient of Shanghai Overseas Talents Award (Baiyulan Young Talent Program), 2023
Oct 2025: PointSeg awarded ICCV Workshop Best Demonstration Award
Jul 2025: IPVG accepted by ACM MM 2025 and runner-up in the 2025 ACM Multimedia Challenge (Identity-preserving Video Generation)
Jul 2025: DICE-Talk and StrandDesigner accepted by ACM MM 2025
Jun 2025: OracleFusion and UniCombine accepted by ICCV 2025
Feb 2025: Eight papers (Sonic, VTON-HandFit, FTEdit, CustAny, GroundingFace, SVFR, DVHGNN, Mobilemamba) accepted by CVPR 2025 (ranked 92nd globally)
Jul 2024: 3Diffusion accepted by ACM MM 2024
Jul 2024: RLR and DiffuMatting accepted by ECCV 2024
Jul 2024: One paper accepted by IEEE TCSVT
May 2024: One paper accepted by Pattern Recognition (PR)
Selected publications include: Sonic (CVPR 2025), OracleFusion (ICCV 2025), VTON-HandFit (arXiv 2024), DiffuMatting (ECCV 2024), RLR (ECCV 2024), Manipvqa (IROS 2024), HitNet (AAAI 2023), Plug-and-Play 3D (TPAMI 2022), among others
Co-authors
25 total
Bjoern Menze
Universität Zürich
Donghao Luo
Youtu lab@Tencent, Shanghai Jiao Tong University
Jiangning Zhang (张江宁)
Youtu Lab, Tencent | Zhejiang University
Wenqi Ren
Sun Yat-sen University
Chengjie WANG(汪铖杰)
Tencent Youtu Lab, Shanghai Jiao Tong University
Co-author 6
Xiaochun Cao
Sun Yat-sen University
Hongwei Bran Li
Martinos Center, MGH, Harvard Medical School
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up