Scholar
Weitai Kang
Google Scholar ID: hDl0MkwAAAAJ
University of Illinois Chicago
Large Multimodal Model
Visual Grounding
AI Agent
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
106
H-index
8
i10-index
5
Publications
13
Co-authors
21
list available
Contact
No contact links provided.
Publications
9 items
Inline Critic Steers Image Editing
2026
Cited
0
From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields
2025
Cited
0
VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction
2025
Cited
0
Investigating the Design Space of Visual Grounding in Multimodal Large Language Model
2025
Cited
0
GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning
2025
Cited
0
InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction
2025
Cited
0
3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation
2025
Cited
0
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning
arXiv.org · 2024
Cited
2
Load more
Resume (English only)
Co-authors
21 total
Yan Yan
University of Illinois Chicago
Mubarak Shah
Trustee Chair Professor of Computer Science, University of Central Florida
Junyi Wu
University of Illinois Chicago
Yunchao Wei
Professor, Beijing Jiaotong University, UTS, UIUC, NUS
Mengxue Qu
Beijing Jiaotong University
Hao Tang
Peking University | CMU | ETH Zurich | University of Oxford | University of Trento | NEU | IIAI
Yuzhang Shang
Assistant Professor at University of Central Florida
Bin lei
University of Minnesota
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up