AgoraResearch hub
ExploreLibraryProfile
Account
Shilong Liu
Scholar

Shilong Liu

Google Scholar ID: nkSVY3MAAAAJ
RS@ByteDance, PhD@THU
Computer VisionObject DetectionVisual GroundingMulti-ModalityMultimodal Agent
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
12,342
 
H-index
30
 
i10-index
44
 
Publications
20
 
Co-authors
40
list available
Contact
No contact links provided.
Publications
29 items
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding
2026
Cited
0
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
2026
Cited
0
Learning Agent Routing From Early Experience
2026
Cited
0
Wan-R1: Verifiable-Reinforcement Learning for Video Reasoning
2026
Cited
0
UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents
2026
Cited
0
Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts
2026
Cited
0
MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing
arXiv.org · 2026
Cited
0
Web World Models
2025
Cited
0
Resume (English only)
Co-authors
40 total
Lei Zhang
Lei Zhang
International Digital Economy Academy (IDEA)
Feng Li
Feng Li
PhD student, Hong Kong University of Science and Technology
Hao Zhang
Hao Zhang
NVIDIA Research
Tianhe Ren
Tianhe Ren
PhD student of Electrical and Electronic Engineering, The University of Hong Kong
Jun Zhu
Jun Zhu
Professor of Computer Science, Tsinghua University
Hang Su
Hang Su
Associated Professor, Tsinghua University
Zhaoyang Zeng
Zhaoyang Zeng
International Digital Economy Academy
Jianwei Yang
Jianwei Yang
Research Scientist, Meta SuperIntelligence Lab

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?