Scholar

Haozhe Qi

Google Scholar ID: BajLgxUAAAAJ

EPFL

MLLM3Dpose estimationmotion generationvideo understanding

Google Scholar↗

Citations & Impact

All-time

Citations

237

H-index

3

i10-index

2

Publications

6

Co-authors

0

Contact

No contact links provided.

Publications

6 items

AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

2026

Cited

0

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

2026

Cited

0

Proto-Former: Unified Facial Landmark Detection by Prototype Transformer

2025

Cited

0

EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models

2025

Cited

0

LLaVAction: evaluating and training multi-modal large language models for action recognition

2025

Cited

0

MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps

2025

Cited

0

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)