Scholar
Kai Kang
Google Scholar ID: brEBMIkAAAAJ
Apple
computer vision
deep learning
video analysis
object detection
multimodal LLM
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,593
H-index
10
i10-index
10
Publications
13
Co-authors
28
list available
Contact
No contact links provided.
Publications
7 items
KoopmanFlow: Spectrally Decoupled Generative Control Policy via Koopman Structural Bias
2026
Cited
0
Evaluation and LLM-Guided Learning of ICD Coding Rationales
2025
Cited
0
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer
2025
Cited
0
Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping
2025
Cited
0
SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding
2025
Cited
0
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
2025
Cited
0
MapComp: A Secure View-based Collaborative Analytics Framework for Join-Group-Aggregation
arXiv.org · 2024
Cited
1
Resume (English only)
Co-authors
28 total
Xiaogang Wang
Professor of Electronic Engineering, the Chinese University of Hong Kong
Hongsheng Li (李鸿升)
The Chinese University of Hong Kong
Wanli Ouyang (欧阳万里)
Shanghai AI Lab & CUHK
Tong Xiao
Facebook Inc
Chen Change Loy
President's Chair Professor, MMLab@NTU, S-Lab, Nanyang Technological University
Xingyu Zeng
Shenzhen University of Advanced Technology
Ruohui Wang
SenseTime Research
Afshin Dehghan
AI/ML @Apple
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up