Scholar
Jian Luan
Google Scholar ID: 6Z8RUi4AAAAJ
Toshiba, Microsoft, Xiaomi
LLM
VLM
TTS
Singing Synthesis
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
1,354
H-index
18
i10-index
27
Publications
20
Co-authors
6
list available
Contact
No contact links provided.
Publications
77 items
Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation
2026
Cited
0
Dasheng AudioGen: A Unified Model for Generating Coherent Audio Scenes from Text
2026
Cited
0
PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution
2026
Cited
0
SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking
2026
Cited
0
PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media
2026
Cited
0
Beyond Binary: Reframing GUI Critique as Continuous Semantic Alignment
2026
Cited
0
StreamPro: From Reactive Perception to Proactive Decision-Making in Streaming Video
2026
Cited
0
How Mobile World Model Guides GUI Agents?
2026
Cited
0
Load more
Resume (English only)
Co-authors
6 total
Xu Tan
Principal Researcher and Research Manager, Microsoft
Co-author 2
Zhiyong WU (吴志勇)
Associate Professor, Tsinghua University
Co-author 4
Shengchen Li
Xi'an Jiaotong-Liverpool University
YUANBO HOU
University of Oxford
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up