Scholar
Haozhe Qi
Google Scholar ID: BajLgxUAAAAJ
EPFL
MLLM
3D
pose estimation
motion generation
video understanding
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
237
H-index
3
i10-index
2
Publications
6
Co-authors
0
Contact
No contact links provided.
Publications
6 items
AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
2026
Cited
0
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
2026
Cited
0
Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
2025
Cited
0
EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models
2025
Cited
0
LLaVAction: evaluating and training multi-modal large language models for action recognition
2025
Cited
0
MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps
2025
Cited
0
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up