Scholar
Zhi-Qi Cheng
Google Scholar ID: uB2He2UAAAAJ
Assistant Professor @ UW | Graduate Faculty | Ex-CMU, Google, Microsoft | Intel & IBM PhD Fellowship
multimedia processing
multimedia understanding
multimodal foundation model
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,419
H-index
27
i10-index
45
Publications
20
Co-authors
20
list available
Contact
No contact links provided.
Publications
24 items
ATSS: Detecting AI-Generated Videos via Anomalous Temporal Self-Similarity
2026
Cited
0
Language-Conditioned World Modeling for Visual Navigation
2026
Cited
0
FlexMap: Generalized HD Map Construction from Flexible Camera Configurations
2026
Cited
0
Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding
2026
Cited
0
Cell Behavior Video Classification Challenge, a benchmark for computer vision methods in time-lapse microscopy
2026
Cited
0
Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding
arXiv.org · 2026
Cited
0
Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation
2025
Cited
0
GoViG: Goal-Conditioned Visual Navigation Instruction Generation
2025
Cited
0
Load more
Resume (English only)
Co-authors
20 total
Alex Hauptmann
Carnegie Mellon University
Qi Dai
Microsoft Research
Yuxuan Zhou
University of Mannheim
Yu-Gang Jiang
Professor, Fudan University. IEEE & IAPR Fellow
Teruko Mitamura
Research Professor of Language Technologies Institute, School of Computer Science, Carnegie Mellon
Xian-Sheng Hua (华先胜)(IEEE Fellow)
Tongji University & Terminus Group
Chong-Wah Ngo
Singapore Management University
Margret Keuper
University of Mannheim
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up