Scholar
Kaichen Zhang
Google Scholar ID: 4_mo8-oAAAAJ
Nanyang Technological University
VLMs
Computer Vision
Multi-modality
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,747
H-index
6
i10-index
6
Publications
8
Co-authors
6
list available
Contact
No contact links provided.
Publications
13 items
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence
2026
Cited
0
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
2026
Cited
0
Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs
2026
Cited
0
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
2026
Cited
0
DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction
2025
Cited
0
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
2025
Cited
0
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
2025
Cited
0
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
2025
Cited
0
Load more
Resume (English only)
Co-authors
6 total
Brian (Bo) Li
PhD Student@NTU, Singapore
Ziwei Liu
Associate Professor, Nanyang Technological University
Yuanhan Zhang
PhD Candidate, MMLab@NTU
Peiyuan Zhang
University of California, San Diego
Jingkang Yang
PhD, MMLab@NTU
Fanyi Pu
MMLab@NTU, Singapore
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up