Scholar
Xiaobin Hu
Google Scholar ID: 3lMuodUAAAAJ
Tencent Youtu Lab;Technische Universität München (TUM)
Deep learning
Computer vision
VLM
Agents
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
5,641
H-index
20
i10-index
28
Publications
20
Co-authors
25
list available
Contact
Email
xbhunanu@126.com
GitHub
Open ↗
Publications
57 items
PASK: Toward Intent-Aware Proactive Agents with Long-Term Memory
2026
Cited
0
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
2026
Cited
0
UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy
2026
Cited
0
CLEAR: Context-Aware Learning with End-to-End Mask-Free Inference for Adaptive Video Subtitle Removal
2026
Cited
0
TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics
2026
Cited
0
MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems
2026
Cited
0
The Trinity of Consistency as a Defining Principle for General World Models
2026
Cited
0
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
2026
Cited
0
Load more
Resume (English only)
Academic Achievements
Recipient of Shanghai Overseas Talents Award (Baiyulan Young Talent Program), 2023
Oct 2025: PointSeg awarded ICCV Workshop Best Demonstration Award
Jul 2025: IPVG accepted by ACM MM 2025 and runner-up in the 2025 ACM Multimedia Challenge (Identity-preserving Video Generation)
Jul 2025: DICE-Talk and StrandDesigner accepted by ACM MM 2025
Jun 2025: OracleFusion and UniCombine accepted by ICCV 2025
Feb 2025: Eight papers (Sonic, VTON-HandFit, FTEdit, CustAny, GroundingFace, SVFR, DVHGNN, Mobilemamba) accepted by CVPR 2025 (ranked 92nd globally)
Jul 2024: 3Diffusion accepted by ACM MM 2024
Jul 2024: RLR and DiffuMatting accepted by ECCV 2024
Jul 2024: One paper accepted by IEEE TCSVT
May 2024: One paper accepted by Pattern Recognition (PR)
Selected publications include: Sonic (CVPR 2025), OracleFusion (ICCV 2025), VTON-HandFit (arXiv 2024), DiffuMatting (ECCV 2024), RLR (ECCV 2024), Manipvqa (IROS 2024), HitNet (AAAI 2023), Plug-and-Play 3D (TPAMI 2022), among others
Co-authors
25 total
Bjoern Menze
Universität Zürich
Donghao Luo
Youtu lab@Tencent, Shanghai Jiao Tong University
Jiangning Zhang (张江宁)
Youtu Lab, Tencent | Zhejiang University
Wenqi Ren
Sun Yat-sen University
Chengjie WANG(汪铖杰)
Tencent Youtu Lab, Shanghai Jiao Tong University
Co-author 6
Xiaochun Cao
Sun Yat-sen University
Hongwei Bran Li
Martinos Center, MGH, Harvard Medical School
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up