Scholar

Zhi-Qi Cheng

Google Scholar ID: uB2He2UAAAAJ

Assistant Professor @ UW | Graduate Faculty | Ex-CMU, Google, Microsoft | Intel & IBM PhD Fellowship

multimedia processingmultimedia understandingmultimodal foundation model

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,419

H-index

i10-index

Publications

Co-authors

list available

Contact

No contact links provided.

Publications

25 items

Sat3R: Satellite DSM Reconstruction via RPC-Aware Depth Fine-tuning

2026

Cited

ATSS: Detecting AI-Generated Videos via Anomalous Temporal Self-Similarity

2026

Cited

Language-Conditioned World Modeling for Visual Navigation

2026

Cited

FlexMap: Generalized HD Map Construction from Flexible Camera Configurations

2026

Cited

Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding

2026

Cited

Cell Behavior Video Classification Challenge, a benchmark for computer vision methods in time-lapse microscopy

2026

Cited

Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding

arXiv.org · 2026

Cited

Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation

2025

Cited

Resume (English only)

Co-authors

20 total

Alex Hauptmann

Carnegie Mellon University

Qi Dai

Microsoft Research

Yuxuan Zhou

University of Mannheim

Yu-Gang Jiang

Professor, Fudan University. IEEE & IAPR Fellow

Teruko Mitamura

Research Professor of Language Technologies Institute, School of Computer Science, Carnegie Mellon

Xian-Sheng Hua (华先胜)(IEEE Fellow)

Tongji University & Terminus Group

Chong-Wah Ngo

Singapore Management University

Margret Keuper

University of Mannheim