Scholar
Wenxuan Song
Google Scholar ID: jtFoCpwAAAAJ
The Hong Kong University of Science and Technology (Guangzhou)
Vision-language-action Model
Robotics
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
144
H-index
7
i10-index
6
Publications
12
Co-authors
9
list available
Contact
No contact links provided.
Publications
31 items
IPIBench: Evaluating Interactive Proactive Intelligence of MLLMs under Continuous Streams
2026
Cited
0
SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation
2026
Cited
0
CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models
2026
Cited
0
RoboMemArena: A Comprehensive and Challenging Robotic Memory Benchmark
2026
Cited
0
DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching
2026
Cited
0
Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance
2026
Cited
0
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
2026
Cited
0
VAMPO: Policy Optimization for Improving Visual Dynamics in Video Action Models
2026
Cited
0
Load more
Resume (English only)
Co-authors
9 total
Pengxiang Ding
Zhejiang University
Han Zhao
Zhejiang University | Westlake University
Donglin Wang
Westlake University
Siteng Huang
Alibaba DAMO Academy | ZJU | Westlake University
Can Cui
Shanghai Jiao Tong University
Zongyuan (Tony) Ge
Associate Prof | Director of AIM for Health Lab | NVIDIA AI Fellowship
Haoang Li
Assistant Professor, Hong Kong University of Science and Technology (Guangzhou)
JIAYI CHEN
Tongji University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up