Scholar

Weitai Kang

Google Scholar ID: hDl0MkwAAAAJ

University of Illinois Chicago

Large Multimodal ModelVisual GroundingAI Agent

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

106

H-index

8

i10-index

5

Publications

13

Co-authors

21

list available

Contact

No contact links provided.

Publications

9 items

Inline Critic Steers Image Editing

2026

Cited

0

From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields

2025

Cited

0

VGent: Visual Grounding via Modular Design for Disentangling Reasoning and Prediction

2025

Cited

0

Investigating the Design Space of Visual Grounding in Multimodal Large Language Model

2025

Cited

0

GuirlVG: Incentivize GUI Visual Grounding via Empirical Exploration on Reinforcement Learning

2025

Cited

0

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

2025

Cited

0

3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation

2025

Cited

0

Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning

arXiv.org · 2024

Cited

2

Resume (English only)

Co-authors

21 total

University of Illinois Chicago

Trustee Chair Professor of Computer Science, University of Central Florida

University of Illinois Chicago

Professor, Beijing Jiaotong University, UTS, UIUC, NUS

Beijing Jiaotong University

Peking University | CMU | ETH Zurich | University of Oxford | University of Trento | NEU | IIAI

Assistant Professor at University of Central Florida

University of Minnesota