Scholar
Wenxuan Song
Google Scholar ID: jtFoCpwAAAAJ
The Hong Kong University of Science and Technology (Guangzhou)
Vision-language-action Model
Robotics
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
144
H-index
7
i10-index
6
Publications
12
Co-authors
9
list available
Contact
No contact links provided.
Publications
27 items
DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching
2026
Cited
0
Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance
2026
Cited
0
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
2026
Cited
0
VAMPO: Policy Optimization for Improving Visual Dynamics in Video Action Models
2026
Cited
0
S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight
2026
Cited
0
PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation
2026
Cited
0
Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline
2026
Cited
0
FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
2026
Cited
0
Load more
Resume (English only)
Co-authors
9 total
Pengxiang Ding
Zhejiang University
Han Zhao
Zhejiang University | Westlake University
Donglin Wang
Westlake University
Siteng Huang
Alibaba DAMO Academy | ZJU | Westlake University
Can Cui
Shanghai Jiao Tong University
Zongyuan (Tony) Ge
Associate Prof | Director of AIM for Health Lab | NVIDIA AI Fellowship
Haoang Li
Assistant Professor, Hong Kong University of Science and Technology (Guangzhou)
JIAYI CHEN
Tongji University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up