Scholar
Shilong Liu
Google Scholar ID: nkSVY3MAAAAJ
RS@ByteDance, PhD@THU
Computer Vision
Object Detection
Visual Grounding
Multi-Modality
Multimodal Agent
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
12,342
H-index
30
i10-index
44
Publications
20
Co-authors
40
list available
Contact
No contact links provided.
Publications
26 items
Wan-R1: Verifiable-Reinforcement Learning for Video Reasoning
2026
Cited
0
UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents
2026
Cited
0
Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts
2026
Cited
0
MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing
arXiv.org · 2026
Cited
0
Web World Models
2025
Cited
0
CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations
2025
Cited
0
AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design
2025
Cited
0
Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding
2025
Cited
0
Load more
Resume (English only)
Co-authors
40 total
Lei Zhang
International Digital Economy Academy (IDEA)
Feng Li
PhD student, Hong Kong University of Science and Technology
Hao Zhang
NVIDIA Research
Tianhe Ren
PhD student of Electrical and Electronic Engineering, The University of Hong Kong
Jun Zhu
Professor of Computer Science, Tsinghua University
Hang Su
Associated Professor, Tsinghua University
Zhaoyang Zeng
International Digital Economy Academy
Jianwei Yang
Research Scientist, Meta SuperIntelligence Lab
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up